Data Scientist
Adhoc SCF
Built an AI-powered automation system to modernize client portfolio analysis and reduce manual effort in extracting, validating, and reporting investment data.
- Processed PDFs, Excel, scanned documents, and images using OCR with Google Document AI and Gemini for document classification and structured data extraction.
- Delivered asset categorization, ISIN matching, web enrichment, KID document analysis, and cost extraction across heterogeneous sources.
- Contributed to database design for asset storage and historical tracking, ensuring data traceability and GDPR-compliant processing.
- Generated professional PDF/Excel benchmark reports with “Old vs New” validation; built a prototype application and API access layer for the full ingestion-to-reporting workflow.