xml - regex - find/replace in 5.000 files - national library
I am working at National library in Slovenia on the IMPACT project of digitazing and OCR books from 19th century. It aims to significantly improve access to historical text and to take away the barriers that stand in the way of the mass digitisation of the European cultural heritage.
We are working also with xml files - there are about 5.000 files.
We are changing (find-replace) some mistakes in them with Text crawler.
We have already correct some mistakes - background color, color of fonts, etc., but we can't find a simple Regular expression for finding and replacing these example: