When you add documents to a collectionA collection is a container for storing and organising ingested files and documents. Only the textual content is stored in collections, not the original files and documents. the documents are ingested into ETA. ETA extracts the text from the documents and saves it as XML files (one file per document). Entities (such as people, organisations, locations and email addresses) and non-entities (such as dates and times, addresses and money) are identified and marked with coloured labels. Each unbroken annotation of marked up text is referred to as a text reference.
Depending on the settings in the ingestion configuration you selected, ETA may also make a copy of the source documents and store them in a database.
There are four ways in which you can add documents to a collection:
You can add documents to a collection at any time. You can also use the Ingestion Scheduler to automatically check for new or updated files in a specific library, or new emails from a server, and ingest them into a collection (see Scheduling ingestions).
To add documents to a collection:
If you have a large amount of data to process, put it in a server library to which ETA has access then use the Server Library tab to set up the ingestion.
Caution: During document uploads your browser will freeze and will not available for other activities. If you want to access ETA during uploads, open a second browser. For example, if you are uploading documents to ETA on Google Chrome, run another instance of ETA on Firefox until the upload is complete.
Note: You can also drag and drop files from Windows Explorer onto the Local Files tab.
The URL tab enables you to extract text based information directly from the internet in three ways:
To ingest text or files from a URL:
Note: Each time you log in to a mail server, the login details you entered (but not the password) are saved in the Login History section below the login fields. Expand this section to show the details of the five most recent logins. To populate the login fields with the details of one of the mail servers (rather than completing the fields manually), click the relevant row. Enter the password then click Login.
Note: By default, emails in subfolders will be also ingested. If you do not want subfolders to be included, untick the ‘Recursive’ checkbox beside the Submit button.
The emails are added to the Document list under the ‘Add Documents’ pane.
You can now explore and analyse the information in the collection (see Exploring and analysing information).