ZHANG LINA (Elena)

AI Engineer

I'm AI Engineer

Building end-to-end ML systems from research to deployment.

About

AI researcher focused on practical machine learning systems.

ZHANG LINA profile photo

Artificial Intelligence Researcher & Machine Learning Engineer

AI Engineer at DMSLab, Konkuk University — developing and evaluating deep learning systems for computer vision, multimodal understanding, and speech synthesis.

  • Affiliation: DMSLab, Konkuk University
  • Location: Korea
  • Degree: M.S. in AI

I work on end-to-end AI pipelines — from data preprocessing and model training to evaluation and deployment. Experience includes age & gender recognition, food detection on AI Hub datasets, multimodal model evaluation, historical figure TTS restoration, and image similarity research.

Computer VisionMultimodalTTSPyTorchMLOps

Research Highlights

Verifiable research and engineering output.

Research Interests

Primary areas of focus and ongoing exploration.

Computer Vision

Object detection, face analysis, age & gender recognition, and image similarity.

Multimodal Learning

Cross-modal evaluation, SLM surveys, and performance benchmarking.

Speech Synthesis

TTS systems, historical audio restoration, and speech emotion recognition.

Model Evaluation

Systematic experiment design, ablation studies, and deployment-ready pipelines.

Technical Skills

Tools and frameworks applied in research and production.

Core ML

PythonPyTorchTensorFlowOpenCVScikit-learnPandasNumPy

MLOps & Infrastructure

DockerUbuntuGitModel EvaluationData Pipelines

Research Areas

Computer VisionObject DetectionMultimodal LearningTTSSpeech EmotionImage Similarity

Application

ReactHTMLCSSJavaScriptREST APIs

Projects

Selected work with problem context, methods, and open-source links.

  • All
  • CV
  • Multimodal
  • Speech
  • Product
Age and Gender Recognition

Age & Gender Recognition

Problem: Robust face attribute recognition for production scenarios.
Method: PyTorch + OpenCV pipeline with optimized architecture and training strategy.
Result: Improved recognition accuracy through model refinement and data augmentation.

CVPyTorchOpenCV
GitHub
Historical Figure TTS Restoration

Historical Figure TTS Restoration

Problem: Restore historical figure voices from text input.
Method: MB-iSTFT-VITSv2 based TTS system with custom training pipeline.
Result: End-to-end speech synthesis for historical audio restoration use cases.

SpeechTTSPyTorch
GitHub
Food Detection

Food Detection

Problem: Detect and classify food items in real-world images.
Method: AI Hub dataset preprocessing, model training, and benchmark evaluation.
Result: Production-ready detection pipeline validated on AI Hub benchmarks.

CVDetectionAI Hub
Multimodal Model Evaluation

Multimodal & SLM Research

Problem: Evaluate and compare multimodal and small language models systematically.
Method: Experiment design, benchmarking framework, and curated SLM survey.
Result: Reproducible evaluation workflow and maintained reading list.

MultimodalSLMEvaluation
GitHub
Speech Emotion Recognition Survey

Speech Emotion Recognition Survey

Problem: Navigate the rapidly growing SER research landscape.
Method: Curated paper collection with categorization and reading notes.
Result: 2+ stars on GitHub; referenced SER resource for researchers.

SpeechSERSurvey
GitHub
Text Emotion Analysis

Text Emotion Analysis

Problem: Classify emotional content in text data.
Method: NLP pipeline with deep learning classifiers.
Result: Baseline emotion classification models and experiment code.

NLPEmotionPython
GitHub
LUMIÈRE Jewelry Boutique

LUMIÈRE — Jewelry E-commerce

Problem: Deliver a luxury jewelry shopping experience with catalog, checkout, and AI assistance.
Method: Next.js full-stack app with Stripe payments, GIA certificates, and AI concierge.
Result: Production MVP live on Vercel with curated collections and custom order flow.

Next.jsE-commerceAI Assistant
Live Demo

Publications & Surveys

Curated technical surveys and research reading lists.

2023Survey

Speech Emotion Recognition Paper Survey

Comprehensive collection of SER papers with categorization. 2 GitHub stars.

2023Survey

Awesome Text-to-Speech List

Curated TTS papers, models, and resources for speech synthesis research.

2025Survey

Awesome Small Language Model List

Reading list and notes on small language models and related multimodal methods.

2023Survey

Awesome Code Generation List

Curated resources for code generation and program synthesis research.

Resume

Education and professional experience. CV available upon request.

Summary

AI Engineer & ML Engineer

Building practical AI systems across CV, multimodal, and speech domains.

Education

Master of AI, Konkuk University

2019 – 2021

DMSLab, Rochester Institute of Technology collaboration, Korea

Research focus: computer vision and multimodal learning.

Bachelor of Computer Science

2012 – 2016

Changchun University, China

Computer Science and Technology.

Professional Experience

AI Research Lab

2021 – Present

Researcher

  • Age & gender recognition — improved model accuracy via architecture optimization and training pipeline refinement
  • Food detection — built preprocessing, training, and evaluation pipeline on AI Hub datasets
  • Multimodal models — designed experiments and analysis framework for performance evaluation
  • Historical figure TTS — developed MB-iSTFT-VITSv2 based speech synthesis for audio restoration
  • Image similarity — researched and developed novel models and methodologies

Open Source

Featured repositories on GitHub.

MB-iSTFT-VITSv2 based text-to-speech system for voice synthesis.

PythonTTSPyTorch

Speech Emotion Recognition paper survey — 2 stars.

SurveySpeech

Curated small language model reading list and research notes.

SLMMultimodal

TTS papers, models, and tools collection.

TTSSurvey

Face analysis and emotion-related CV experiments.

CVPython

Text-based emotion classification models and experiments.

NLPEmotion

Blog & Writing

Technical notes, tutorials, and research reading.

CSDN Blog

CV tutorials, multimodal notes, environment setup guides, and paper reading summaries.

SER Reading Notes

Speech emotion recognition paper survey and categorized reading list.

TTS Resources

Maintained list of text-to-speech papers, datasets, and open-source tools.

Contact

Open to research collaboration and ML engineering opportunities.

LinkedIn

Connect

GitHub

@zhanglina94