How to Identify Unsupported Table Formats for Parsing

This article will show and explain which PDF schedule table formats are not supported for Parspec's parsing capabilities.

Schedule Parser - Unsupported Table Types

  1. Rotated tables - 90, 180 or 270 degrees rotated or table cells with vertically aligned text
  2. Semi-bordered tables - columns separated by lines but rows not separated or rows separated by lines but columns not separated
  3. Tables with clouds/shapes
  4. Tables with cell text or shapes touching boundaries 
  5. Tables with shaded cells (our algorithm assumes table text and cell boundaries are black and background is white)
  6. Disjoint tables 
  7. Bordered tables where left, right, top or bottom cell boundaries are not selected due to bad user selection
  8. Borderless tables with close-by columns 

Examples 

1a. Rotated tables - 90, 180 or 270 degrees rotated 

1b. Tables with cell text vertically aligned 

2. Semi-bordered tables - columns separated by lines but rows not separated or rows separated by lines but columns not separated

3. Tables with clouds/shapes 

4. Tables with cell text or shapes touching cell boundaries 

5. Tables with shaded cells

6. Disjoint tables - left table disconnected from main table

7. Bordered tables where left, right, top or bottom cell boundaries are not selected due to bad user selection

8. Borderless tables with close-by columns