I want to make a page listing all of the photographs on my website along with their titles and alternative representations.
I already created a little program to locate and load every HTML file, but I am having trouble figuring out how to extract the src, title, and alt attributes from this HTML:
<img src="/image/johnny.jpg" title="my pet johnny" alt="pretty and adorable" />
The sequence of the tags may vary, and I need all of them, so I suppose this should be done with some regex, but I'm not really sure how to parse this elegantly (I could do it the hard char by char way, but that's awful).