CIGNEX and Relevance Lab have joined forces. Learn more at Relevance Lab

Case Study

Quick PoC to demonstrate Document Classification

Solutions	Industry	Technology	Expertise Delivered
AI & Ml	Legal	Java: Tf-idf+sdm	Development

Client Overview

With over 15,000 employees, active in across 150 countries offering expertise in Health, Tax and Accounting, Governance, Risk and Compliance

Business Need

Their current classification process of 10-K forms was manual, error prone and not scalable. With 10M documents and 36 target categories they wanted an intelligent classification model.

Key Features

Input: XML files / Output: Text files using Parser – Apache Tika + Custom
Evaluated DL4J, Naïve Baiyes and TensorFlow as Classifiers that run models and test set
Reviewer – results of classifier including audit logs, docs parsed and reviewing outliers
Custom Reviewer to capture results of classification iterations in terms of accuracy

Results

Additional ~1800 documents included in the POC in addition to the original ~2000 to validate the accuracy.
Naïve Bayes provided the highest level of accuracy(~95%). Accuracy can be further enhanced by including external feature set
Scalability can be achieved by using / building Big Data frameworks for distributed computing

Download Case Study

Math question 1 + 0 =

Solve this simple math problem and enter the result. E.g. for 1+3, enter 4.

CIGNEX is a global consulting company offering solutions, services and platforms on Open Source, Cloud and Automation technologies. Since 2000, CIGNEX has been delivering enterprise class solutions, which are built using leading platforms & can easily be integrated with existing systems to achieve unparalleled results. By leveraging multiple delivery models, we help organizations around the world to increase revenue, achieve business goals, gain competitive advantage, and maximize customer satisfaction while significantly reducing the cost of doing business.

As a leading System Integrator, CIGNEX provides end-to-end services on Liferay | UiPath | Kafka – Confluent | Sitecore | Red Hat | Appian | Salesforce | Servicenow | MongoDB | Drupal

For any questions, RFP or to get in touch, you can email us at info@cignex.com

CIGNEX and Relevance Lab have joined forces. Learn more at Relevance Lab

Solutions

Services

Staffing Services

Resources

About Us