Extract Content

Examples

Extract page content in PDF syntax from book.pdf into out:

  1. pdfcpu extract -mode content book.pdf out
  2. extracting content from book.pdf into out ...
  3. cd out && ls
  4. -rwxr-xr-x 1 horstrutter staff 9.1K Mar 8 12:27 23_791.txt*
  5. -rwxr-xr-x 1 horstrutter staff 4.2K Mar 8 12:27 25_824.txt*
  6. -rwxr-xr-x 1 horstrutter staff 1.9K Mar 8 12:27 8_147.txt*
  7. -rwxr-xr-x 1 horstrutter staff 9.3K Mar 8 12:27 10_173.txt*
  8. -rwxr-xr-x 1 horstrutter staff 12K Mar 8 12:27 18_330.txt*
  9. -rwxr-xr-x 1 horstrutter staff 7.2K Mar 8 12:27 19_353.txt*
  10. cat 8_147.txt
  11. BT
  12. /P <</MCID 0 >>BDC
  13. /CS0 cs 0 0 0 scn
  14. /GS0 gs
  15. /TT0 1 Tf
  16. 12 0 0 12 306 708.96 Tm
  17. ( )Tj
  18. EMC
  19. /P <</MCID 1 >>BDC
  20. 0 -1.15 TD
  21. ( )Tj
  22. EMC
  23. ET
  24. /InlineShape <</MCID 2 >>BDC
  25. q
  26. 107.94 692.52 396.12 -153.84 re
  27. W* n
  28. q
  29. /GS1 gs
  30. 396.1199951 0 0 153.7200012 107.9400024 538.740097 cm
  31. /Im0 Do
  32. Q
  33. etc..