Spatial Text Service
Overview
ai-core-spatial-text service utilizes the cognaize_spatial_text Python library for
extracting text from PDF documents while preserving the document's spatial layout by using
Optical Character Recognition (OCR). The service is using aiohttp for handling HTTP and
WebSocket connections. For more details about cognaize_spatial_text, please refer to
the documentation of Spatial Text Package.
Installation
Install the required dependencies using pip along with the FURY_AUTH authentication token:
FURY_AUTH=${FURY_AUTH} pip install -r requirements.txt
Running the Service Locally
Launching the Server:
python app/server.py
Monitoring Service Progress via WebSocket [Optional]
- Set the request type to
WebSocket - Enter the WebSocket URL:
ws://localhost:8000/ws - Click
Connect
Uploading PDF File to Process: To upload a PDF file for processing:
- Open Postman
- Set the request type to
http - Set the method to
POST - Use the URL http://localhost:8000/txt
- Add a key with the type
Fileand name itfile - Select the PDF file from your computer that you wish to upload
- Click
Sendto upload the file and start the Spatial Text Creation process
Viewing the Output:
- Response Format:
The response will appear in
JSONformat as follows:
{
"result": "spatial text content",
"filename": "uploaded_file.pdf"
}
- WebSocket Updates: If connected via WebSocket, you'll receive real-time updates on the process of creating Spatial Text.