liberTECHS

  • About
  • Contact
  • Portfolio
  • Home

Knowledge Extraction With Spark

Goal

Derive meaningful insights and knowledge from large volumes of text data.

Role

I developed a python and spark based tools to recognise, process and summarise untructured text data.

Capability

Highly customisable toolkit for rapid and high quality exploration of text patterns:

  • Profile unstructured text data by recognizing and quantifying either all text strings or user-defined ones.
  • Capture context variations around text patterns of interest
  • Conduct semantic grouping at scale to enhance clarity regarding patterns in text and improve the accuracy of conclusions.
  • Produce analytics-ready summarized data.

Skills

Regex, Spark, Python, Big Data

FROM:

Polluted + unnecessarily dilluted view of text patterns

TO:

Contextual focus + meaningful relative scale

Year

2022

Filed Under: Data Engineering & BI

CONTACT

To get in touch, please fill out a contact form or give me a call at +44 7854 878 140.

 

SOCIALIZE

  • LinkedIn

JUMP TO

About
Portfolio
Home

© Copyright 2020 · LIBERTECHS · All Rights Reserved

We use cookies to ensure that we give you the best experience on our website. If you continue to use this site we will assume that you are happy with it.
You can revoke your consent any time using the Revoke consent button.