dinarazhorabek.github.io / resume
About

Data engineer and analyst with 3 years of experience building production pipelines, AI-powered platforms, and data governance systems across finance, healthcare, and labor markets. I work across the full data stack, from ETL and schema design to LLM integration and stakeholder-facing dashboards. Currently finishing my MS in Applied Business Analytics at Boston University, graduating May 2026.

Education
Boston University Metropolitan College
M.S. Applied Business Analytics  ·  Boston, MA
GPA 3.81/4.0  ·  Data Mining · Data Science · Marketing Analytics · Enterprise Risk Analytics · Cloud Analytics
Kazakh-British Technical University
B.S. Information and Communication Technology  ·  Almaty, Kazakhstan
GPA 3.57/4.0  ·  Software Engineering · Algorithms · Object-Oriented Programming · Enterprise Architecture
Experience
Boston University
Graduate Research Assistant  ·  Boston, MA
  • Transformed disconnected labor market data sources into a single scalable pipeline processing 3M+ data points in Python and SQL, delivering warehouse-ready datasets with automated quality monitoring and anomaly detection that enabled reproducible research outputs for cross-functional stakeholders.
  • Designed and deployed KPI dashboards translating complex analytical findings into clear, actionable outputs for cross-functional stakeholders.
  • Improved advising outcomes by building and deploying a student advising platform, managing stakeholder onboarding, collecting user feedback, and benchmarking performance against traditional methods.
Python SQL Django Plotly Azure LangChain
The Build Fellowship
Data Science Build Student Consultant  ·  New York City, NY
  • Identified statistically significant disparities in patient demographics and treatment outcomes across a 65,000+ patient diabetes dataset, producing evidence-based recommendations that informed early-stage clinical program design.
Excel SPSS Statistical Analysis
VTB Bank Kazakhstan
Full-Stack Software Engineer in Financial Systems  ·  Almaty, Kazakhstan
  • Closed gaps in financial risk visibility by designing and deploying risk monitoring dashboards in Oracle APEX, enabling real-time identification of suspicious client activity and anomalous transactions, and surfacing risk metrics that directly supported data governance and regulatory reporting requirements for compliance and operations leadership.
  • Resolved critical reporting latency issues slowing compliance and investment operations by optimizing large-scale relational queries and PL/SQL logic using DBMS Profiler, achieving roughly 50% performance improvement and significantly improving data accuracy and delivery speed for business stakeholders.
  • Designed and maintained data mapping logic and automated workflows in PostgreSQL and Oracle, ensuring integrity and consistency of financial data across engineering and product teams in line with internal governance standards.
  • Cut feature rollout time by 30% by developing high-throughput data pipelines, streamlining data delivery and reducing errors in downstream financial reporting.
PostgreSQL Oracle PL/SQL Oracle APEX React Node.js JavaScript Git CI/CD
Projects
Django · Python · LangChain · LLMs · Azure
Built an AI-driven analytics system over 3M+ labor market records, enabling natural language querying of job trends and automated career insights.
Research paper accepted (forthcoming): SET III Symposium on Entrepreneurship & Technology, 2026.
Python · Plotly · Streamlit
Won 1st place at BU MET Hackathon 2025 by building a sentiment-driven trading strategy with automated backtesting and an interactive portfolio performance dashboard.
Predicted dispute outcomes and automated intent detection across thousands of consumer complaints using a Naive Bayes NLP model in R, achieving 74.45% accuracy.
Python · Scikit-learn
Built ML-based trading strategies on Boeing and S&P 500 data, generating $294 profit from $100 in backtesting.
Python · NLP · ML · Quarto
Integrated Lightcast job-posting data with FRED macroeconomic indicators to analyze 2024 U.S. labor market trends, applying NLP and statistical modeling to evaluate hiring patterns, skill demand, and regional wage dynamics.
Classified stellar variability across 100K+ stars using kNN, Logistic Regression, SVM, and Random Forest models, with Gaussian SVM achieving 71% test accuracy.
Skills
Languages
Python · R · SQL · Java · JavaScript
ML & AI
LangChain · LLMs · Prompt Engineering · Scikit-learn · PySpark · TensorFlow · Transformers · Naive Bayes · Random Forest · Decision Trees · k-means · NLP
Data Engineering
ETL/ELT Pipelines · Data Modeling · Schema Design · Data Governance · Data Quality · Anomaly Detection · PostgreSQL · Oracle · PL/SQL · Snowflake · BigQuery · MongoDB · API Integration
Cloud & DevOps
AWS · Azure · Docker · CI/CD · Git · REST APIs
Visualization
Plotly · Streamlit · Tableau · Power BI · Matplotlib · Django · SPSS · A/B Testing
Web
React · Node.js · Oracle APEX
Additional
Publications Zhorabek et al. (2026). "Career Compass Labor Analytics: Visualization Architecture for Career Planning." SET III Symposium on Entrepreneurship & Technology (forthcoming).
Awards Presidential "Bolashak" Scholarship (Kazakhstan) · BU MET Hackathon Winner (2025)
Certifications AWS Academy Graduate, Cloud Foundations (Nov 2025) · Data Analyst (DataCamp) · Back End Development and APIs (freeCodeCamp)