Skip to content

The DeepDive knowledge engine is fuelled by a unique blend of technologies 

HIW process flow
How it Works
  • Search: Our UI builds extensive search parameters

    DD_Search simplified
      • Permutations of person details, keywords and related countries executes searches across multi-language, alphabet and engine parameters. 
      • Business search returns detailed company reports and widens investigation to company hierarchies and business associates.


  • Source collation: Automated creation of a large investigation dataset

    Data extraction (NLP1)
      • Hundreds of sources are parsed to remove code, images and adverts to create clean text files.
      • Foreign language sources, compliance datasets and custom client sources  are combined to maximise the investigation horizon.  
  • NLP: Digests and analyses hundreds of search results

    Data digestion (NLP2)
      • Natural Language Processing digests sources in any language.
      • Named Entity Recognition extracts names, organisations, dates, and other structured entities from unstructured text.

       

  • Entity resolution: Removes false positives

    Entity resolution
      • Graph-based, pairwise and hierarchical clustering algorithms correlate relevant sources and isolate sources not pertaining to the search terms.
      • Resolved entities are subjected to an additional verification process using adversarial AI to ensure semantic relevance.     

  • Body of Knowledge: Source verification and statement extraction

    Body of Knowledge
      • Sources are scored  based on credibility, publication date, reputation, and consistency with other sources.
      • Prompt engineering extracts statements and assigns confidence scores for transparency on data reliability.
      • All statements are merged into a single XML database and structured according to use case.

  • Report: Pre-defined output oriented to use case

    Report construction
      • Summary of Key Findings: High-level overview of subject-related data.
      • Categorized sections: Financial insight, legal matters, business affiliations, career summary, personal background.  
      • Risk Indicators: Contextual assessment of legal, criminal, financial, regulatory risk. 
      • Associated Entities and Networks: Connection mapping between subject entities, businesses, and individuals.

       

  • Chatbot: Interrogation of body of Knowledge

    Chatbot
      • Clients can get straight to the heart of the matter without reading the whole report.
      • Provides context-aware responses using structured database statements with source citations.
      • Chatbot answers can be easily copied into clients’ own systems.

swiper preview button
Swiper Next button