Dmitriy Ulybin
Portfolio
Xena Exchange
● Uplift Modeling & ETL Automation for Crypto Exchange. Uplift Modeling: ● Developed a classification model (CatBoost) to predict the impact of ad campaigns on user conversions. ● Built a data processing pipeline (Python, Pandas) and applied feature engineering, improving F1-score to 0.89. ● Impact: Increased conversion rates by 22% for users receiving targeted offers (account top-ups). ETL Automation: ● Designed a data collection system aggregating information from 9 sources (REST API + asynchronous parsing via BeautifulSoup and Selenium). ● Orchestrated ETL workflows in Airflow with DAGs for hourly data updates. ● Integrated structured data into PostgreSQL and visualized analytics with Plotly. ● Impact: Reduced data retrieval time for analysts by 2–3x.
IT Solutions
● Developed ML solutions for industry clients, including: - Oil & Gas: Failure prediction for industrial equipment. - Transportation: Forecasting the month when railcars require maintenance. - Aviation: Ticket ranking model to optimize customer choices. ● Worked on end-to-end ML pipelines, from data preprocessing to model deployment. ● Grew from Intern to Full-Time Specialist.
MTS
● AI-powered Content Navigator (NLP, RecSys) for Movies, Books, Places, and Events. ● Built a RAG-based search system using Vespa vector DB and advanced embedding models, significantly enhancing retrieval quality. ● Improved search relevance through: - Intent classification for accurate query categorization and filtering. - Paraphrase generation via LLM to enhance semantic search and retrieval. - Entity recognition (NER) for precise matching of key information. ● Alignment of LLM and fine-tuning transformers to adapt language models for specific tasks and improve performance. ● Optimized geosearch to enhance search accuracy based on geographical context. ● Increased NDCG and MAP metrics, refining the relevance of the recommended content. ● Designed a microservices architecture with a FastAPI pipeline, orchestrated RAG workflows, and deployed scalable solutions via Docker.