Upcoming Games

exeter and plymouth 2004 TT test Tomorrow at 17:20 UTC waucott (One user signed up)
exeter and plymouth 2004 TT test 07/04/2025 at 17:50 UTC waucott

(UTC times)

Full list
Add a game

Forum

Basingstoke SX 2015-04-08 Yesterday at 18:11 - Gwasanaethau
Manchester South FAQ Yesterday at 15:01 - 9pN1SEAp
Hope Valley SX 2019-01-09 Yesterday at 14:40 - TylerE
How do I fix can't find <body> e... 31/03/2025 at 19:56 - 9pN1SEAp
Glazebrook East Jn & Siding 31/03/2025 at 17:19 - HST125Scorton
Hope Valley SX 2009-10-15 30/03/2025 at 10:10 - Hap
Cheshire Lines FAQ 29/03/2025 at 21:26 - 9pN1SEAp
Lincoln Street LC, Old Basford 29/03/2025 at 10:19 - MrSuttonmann
DOWNLOADS 29/03/2025 at 09:40 - den
1975 timetable q - where is cam... 29/03/2025 at 08:36 - clive
Profile options - Email Subscrip... 28/03/2025 at 20:15 - GeoffM
stopping postions 28/03/2025 at 19:32 - rodney30
'Cannot find timetable for [UID]... 28/03/2025 at 19:31 - GeoffM
Signal 733 at Trafford Park East... 28/03/2025 at 17:06 - Hap
Bug: Trains entering from Church... 28/03/2025 at 14:51 - eps125

Index
Latest posts

User

Log in
Register
What's my IP?
Search

Upcoming Events

No events to display

Who's Online

geswedey (1 users seen recently)

OCR of tables (e.g. WTTs)

You are here: Home > Forum > Miscellaneous > Open mic (non-railway) > OCR of tables (e.g. WTTs)

Page 1 of 1

1

Swipe the screen to the left to view more details

OCR of tables (e.g. WTTs) 12/01/2023 at 14:18 #150131
DonRiver 174 posts	Was wondering if anyone's had a go at using OCR to parse scanned timetables, e.g. those in Network Rail's archive? Just looking at Tesseract OCR's documentation (tesseract-ocr.github.io) - it's designed for reading paragraphs of text, not tables - wondering if there's off-the-shelf image processing techniques for recognising each column by its borders, cropping it out of the image, and OCR'ing it in isolation… it _might_ not actually be difficult in Python (named for the one in Tasmania, not in Russia) Log in to reply

OCR of tables (e.g. WTTs) 12/01/2023 at 16:08 #150132
bill_gensheet 1473 posts	No, but just tried to see how it would go: https://www.onlineocr.net/pdftoexcel Seemed quite good except for dealing with times ending ½ which went to % or 1/2. While fixing the % is easy, 11/221/2 is more complicated to get to 11/22 ½ However that was a 2015 file, which looked like it was printed to pdf rather than scanned. Log in to reply The following user said thank you: DonRiver

1