Safeguarding Customer Data
Processing hundreds of thousands of loan applications annually and safeguarding vast amounts of sensitive financial data is no small task—especially with 20,000-plus employees operating in a highly regulated financial services environment. To achieve 100% GLBA compliance, our customer needed to de-identify customer data to protect against unauthorized access, destruction and loss.
A high-stakes, time-sensitive endeavor—ensuring business continuity and avoiding regulatory penalties.
Partners in Comprehensive GLBA Data Anonymizer Solution
Experts in AI/ML and Data Analytics
To meet GLBA compliance, our customer recognized that an AWS-based data anonymization solution could be the answer. For a Premier Tier Services Partner capable of providing scalable, secure solutions, they turned to TEKsystems Global Services (TGS).
Our experts provided not only technical excellence but also strategic execution, which was essential, given the project’s massive scope: 1.3 petabytes and 13.5 billion objects of complex, semi-structured and unstructured files. In a collaborative approach, our customer set the project direction and prioritized the backlog, while our team brought deep AWS, AI / machine learning (ML) and data analytics expertise to provide:
- Solution architecture advisement
- MI / data engineering
- Project management and reporting
To accelerate delivery and performance, TGS integrated agile data engineering capacity and organized the solution into three parts:
- Raw data classification: examined loan business identifiers and PII columns in structured and semi-structured data using Bedrock LLM
- Semi-structured loan master inventory generation program: used Bedrock LLM to identify and store loan numbers with corresponding locations
- Semi-structured de-identification program: merged qualified data sets with retention data to identify and anonymize specific loans
- Technology stack: leveraging cutting-edge tools to drive intelligent automation and scalable data processing, including AWS Bedrock LLM, Lambda, Glue, Step Functions, Athena and Python/PySpark
- Architecture: a codeveloped reference architecture designed in partnership with the customer to ensure alignment with business and compliance goals
- Delivery model: agile and outcome-driven—direction set by customer and execution led by TGS with precision and flexibility
Interested in a similar solution?
Real-World Results
By implementing this solution, our customer achieved full GLBA compliance for 1.3 petabytes of raw data while also establishing a scalable framework capable of handling massive data volumes with minimal operational overhead.
At a Glance
- 15TB of data processing velocity per day—a 75x improvement over the previous manual process
- Scalable anonymization across 13.5B data objects
- 97.5% accuracy in PII/PIFI detection, exceeding the 95% target
Through close collaboration, we ensured compliance and strengthened the customer’s data security posture, enabling safer data consumption and laying the foundation for future AI/ML innovation.