About content import

Build 1501 on 14/Nov/2017  This topic last edited on: 5/Aug/2014, at 13:53

The ingestion workflows (upload and Back4 queues) execute the parsing of the input file passing it to our parsers modules, that are:

ParseAPDFL;ParseArchive; ParseImg; ParseMultimedia ; ParseOffice; ParseText;ParseCalendar

In the order specified in the appsettings.xml file.

Each parser will try to recognise the file, and if it feels to be the right one, it will generate an xml file containing the info that can be extracted from the file.

By default parser will extract all the available metadata, and in the workflow call we can also tell to do some extra, such as generating thumbnail, preview, video tiles, extract text etc.

The resulting xml file is then used in the workflow to fill the attributes of the gn4 object we want to create/manipulate.

The nice thing about the parsers is that they are independent from the core software.

That means that if we find a "not recognized" file format, we simply fix the specific parser, and you can update only this module in existing installations, without touching other gn4 dlls.

See also

Configuring Content Import