Hello, I'm

Harsh Kashyap

Bioinformatics Data Engineer

Data engineer, AI practitioner and full-stack developer with 4+ years building production data pipelines, LLM-powered apps and relational systems. M.Sc. Bioinformatics & Computational Biology — fluent in life-sciences data and biological databases (NCBI, UniProt, RNAcentral).

Harsh Kashyap 🧬 Genomics ⚙ Data Pipelines

What I Do

From raw sequence and operational data to dependable, query-ready resources and dashboards.

Data Engineering

Pipelines, ETL, event-driven orchestration

Generative AI & LLMs

Claude API, prompt & agentic workflows

Full-Stack Development

React / Next.js + PostgreSQL · Supabase

Bioinformatics

Genomics, RNA, NCBI / UniProt databases

Dashboards & BI

Power BI & DAX, Tableau, Looker Studio

Automation

AppSheet, Apps Script, n8n, Zapier

Key Projects

Production systems, AI workflows and bioinformatics builds — end-to-end ownership.

AI · Bioinformatics · M.Sc. Thesis

AI-Powered Assistant for Scientific Database Navigation

An LLM-integrated agentic assistant that lets researchers query biological databases (NCBI, UniProt) in natural language — with ML-based query understanding and intent classification. An early example of agentic AI workflow design.

PythonBotpressLLM IntegrationNCBI / UniProt
Full-Stack

ProSched — Production Planning App

Full-stack scheduling app with Gantt charts, weekly planning views and relational filtering for manufacturing workflows. Modular architecture, documented API design.

Next.jsTypeScriptPostgreSQL
GitHub ↗
Automation

Dispatch Email Automation Pipeline

Event-driven pipeline triggering branded HTML emails on dispatch confirmation — communication latency cut from hours to seconds with zero manual intervention.

AppSheetApps ScriptHTML
NLP / ML

Semantic Similarity Prediction Model

A BERT-based semantic similarity model with optimised text preprocessing and embedding workflows for high-accuracy contextual understanding.

PythonBERTNLP
Automation · BI

Employee KPI Automation Dashboard

End-to-end automated KPI pipeline — data ingestion, computation, scheduling and dashboard delivery — demonstrating full pipeline ownership.

Apps ScriptGoogle Sheets

Let's work together

Open to Bioinformatics Data Engineering roles — including RNA / sequence resource teams.

© Harsh Kashyap · Meerut, India · +91 73515 34994