This article will show and explain which PDF schedule table formats are not supported for Parspec's parsing capabilities.
Schedule Parser - Unsupported Table Types
- Rotated tables - 90, 180 or 270 degrees rotated or table cells with vertically aligned text
- Semi-bordered tables - columns separated by lines but rows not separated or rows separated by lines but columns not separated
- Tables with clouds/shapes
- Tables with cell text or shapes touching boundaries
- Tables with shaded cells (our algorithm assumes table text and cell boundaries are black and background is white)
- Disjoint tables
- Bordered tables where left, right, top or bottom cell boundaries are not selected due to bad user selection
- Borderless tables with close-by columns
Examples
1a. Rotated tables - 90, 180 or 270 degrees rotated
1b. Tables with cell text vertically aligned
2. Semi-bordered tables - columns separated by lines but rows not separated or rows separated by lines but columns not separated
3. Tables with clouds/shapes
4. Tables with cell text or shapes touching cell boundaries
5. Tables with shaded cells
6. Disjoint tables - left table disconnected from main table
7. Bordered tables where left, right, top or bottom cell boundaries are not selected due to bad user selection
8. Borderless tables with close-by columns