I'm working with the Joomla! CMS and I'm trying to write a simple SEO script which converts keywords to links. It's easy enough, but I'm having an issue with it.
I have GeSHi highlighting throughout my articles using the Joomla! GeSHi library, and sometimes the highlighted code gets ruined with a big juicy href because a keyword appears inside the <pre xml:lang=""></pre> tags.
So basically, in Joomla! you've got $this->item->text, which is the full article text. I'm using this code for my SEO:
I would like this regular expression to be identical to what it is now, but to ignore all text that appears between <pre xml:lang="some_language"> and </pre>.
Can anyone help me out with the necessary expression?
1. I know there are Joomla! extensions that do auto-SEO, but the reason I decided to write my own code is because those extensions also replace unwanted segments of text with links.
2. These are not my actual keywords.
I think that you need to rethink doing this with regular expressions. Better to use DOMDocument or XMLParser. XMLPArser, which parses HTML as well, will be more work but will offer the more elegant solution in the end.
“The power of the Web is in its universality. Access by everyone regardless of disability is an essential aspect.”
—Tim Berners-Lee, W3C Director and inventor of the World Wide Web
I'm not sure I understand where you're coming from. I've just quickly Googled DOMDocument and XML Parser, but I don't have a clue what they are or do. Is this what you're refering to? http://php.net/manual/en/class.domdocument.php
If so, how can it help me? I've never heard of it before and the php.net page doesn't seem to have much explanation other than code examples in the comments.
Also, why is my method not a good way of adding links automatically? Seems simple enough from my perspective. I'm not trying to create a Joomla! add-on or extension, just quick, simple automated links to various pages and articles around my own site (not external links).