You will need to parse each page and search its contents for the keyword. It can get tricky if you go into too much detail, such as disregarding HTML elements and client-side source.
There are several ways to read files using the IO package supplied by Java. I usually use the InputStreamReader or FileReader objects. A google search will warrant some official documentation.
Bookmarks