Search for text on specific lines and within specifc columns

If you need a specific solution and you are not sure which product to use, please post your requirements here and we will advise you on our solution that best suits your needs
Post Reply
jimtell
Posts: 6
Joined: Tue May 22 2007

Search for text on specific lines and within specifc columns

Post by jimtell » Tue Jun 05 2007

We need to be able to programmatically search pdf files for text that is on line x, y or z and is between columns a and b. Our current solution provides the location of each word in the document. This allows us to define searches in a manner that closely approximates columns and lines. We would like to find an alternative solution. Do any of your tools have the ability to provide line and column information about the text within a pdf?

Joan
Amyuni Team
Posts: 2799
Joined: Wed Sep 11 2002
Contact:

Post by Joan » Fri Jun 08 2007

Hello,

The search function of our PDF Creator returns the x and y coordinates to the text object where the text you are searching for was found.

Will this help or you need more information or different information when searching for a text?

jimtell
Posts: 6
Joined: Tue May 22 2007

Post by jimtell » Fri Jun 08 2007

It will help during the actual search. But we need to know the location of all text prior to a search. This allows the user to define a document by indicating that text "x" will be found on line "y" starting in column "z". With this information the program can determine which document definition a document belongs to by doing a search for "x" and then comparing the coordinates returned with each of the document definitions.

Bottom line we need to be able to run a document through a process that will give us the coordinates for all of the text on the page.

Joan
Amyuni Team
Posts: 2799
Joined: Wed Sep 11 2002
Contact:

Post by Joan » Wed Jun 13 2007

Hello,

Using the current version of the PDF Creator you can go over all the objects in a pdf file and retrieve each object's parameters. so you can loop through all the objects, check if the object is a text and retrieve its coordinates.

There are also two functions that might be useful for you GetObjectXY() return the object located at a given coordinate, and DelimitedText() return all the text within a specific area.

Post Reply