Html Form Content Extractor
Hello all, I was trying to extract the labels associated with form elements(like radios, textboxes, etc) , but there is no way to extract them, even via dom it is not possible. I mean, it is completely dependent on heuristics ( eg, distance between form element and label, position of label wrt form element etc). I have to ask that is there any way by which i can extract labels associated with form elements or is there any software available in market?
Also, in html 4 and 5, there is a "label" tag, but it is not widely used, so i can't use that also.
Kindly help me.