Home
I’m Yashovardhan Srivastava, a recent engineering undergraduate(2025) with a strong passion for building, researching and sharing knowledge. I write blogs extensively, both technical and non-technical, so do check them out. Here are my technical blogs, and here are my non technical blogs.
Beyond that, If you want to contact me, reach out to me at ysrivastava82(at)gmail(dot)com. Connect with me on my socials
Should you hire me?
Read the small blurb below about me. Here’s my resume if you prefer it that way:
-
Notable work : I am a 2x Kaggle Expert, created popular Python packages and kernels( which have received appreciation from Kaggle as well). I have 70+ public of open-source projects, some of which have been praised on HN, GitHub, and on multiple instances on Twitter.
-
High Agency : I am obsessed with programming, so much so that I forget everything when I’m in the zone, and agency is the thing that keeps me in it. I take ownership of everything I work on, and can go great lengths to make it happen. I don’t rely on talent - I rely on obsession.
-
Experience : I’ve worked with 2 D2C(AI and AI+Finance) startups, 1 B2B startup(Finance), and once with a leading research instituion of India, and over 70 public open source projects. I’m comfortable in programming and never shy away from learning things I’m unaware of.
If you’d rather skip the details, here’s my resume. Please feel free to reach out to me at ysrivastava82(at)gmail(dot)com or over my socials if you feel I would be good fit for your organization. I would love to connect and chat more!
Projects 🧰
Here I’ll pin some of my favorite projects, more on the research 👨🔬 side(Feel free to critique me on this(and try to contribute if possible :) ) :
-
Cadence : Cadence implements an evolutionary system that uses LLMs to iteratively generate, mutate, and improve programs for solving computational problems such as the Traveling Salesman Problem (TSP).
-
REALBT (Looking for contributors) : REALBT is a simple, effective backtesting engine written in pure python.
-
Arrakis (Looking for contributors) : Arrakis is a library to conduct, track and visualize mechanistic interpretability experiments. 28+ stars on Github ; 250+ monthly PyPi downloads.
-
Deeprobe (Looking for ideas) : Deeprobe is a study to understand feature importance and pattern undderstaing in Sparse Autoencoders using Monte Carlo Tree Search.
-
SAE Macaronic Languages : Understand whether language models learn words beyond language barriers, a study in Mechanistic Interpretability.
-
Secure BPE (Work in Progress) : A modified, secure version of Byte Pair Encoding algorithm.
-
Collaborative Debating (Work in Progress) : A hacky implementation of the paper “Improving Factuality and Reasoning in Language Models through Multiagent Debate”.
-
NEAT-JAX (Working on PR) : An implementation of Neuroevolution of Augmented Topologies Algorithm in JAX which is compatible with EvoJAX. 14+ Github stars ; Multiple PRs
-
Nexus Theory : Can we really trust our human-ness for the messages that we send into the cosmos? Nexus theory is a gamified version to understand machine learning interpretability using Large Language Models.
-
Elixr : Elixr an autograd library using Complex Numbers similar to Pytorch. 3+ Github stars ; Multiple PRs
-
Attention Free Revolution : Developed Leviathan architectures, and alternate to Transformer architecture using a modified attention scores, taking inspiration from signal processing. 7+ Github stars
-
P-GLAm : P-GLAm is a random thought experiment on Infinite Monkey Theorem. In this, I developed a GPT-2 inspired Large Language Model which aims to test the arithmetic correctness.
Here I’ll pin some of my favorite projects, more on the development 💻 side. Feedback is always appreciated for projects like these.:
-
PySlides: PySlides is terminal based application that converts markdown into slides that can be presented from the terminal.
-
Devsidian(Lovable Project) : Log your developement journey using Devsidian. Made using Lovable for personal use.
-
Snappyr : Setup Python Projects Blazingly Fast, and work on things that matter. No External Dependencies.
-
Dynamo : Dynamo is a Python/Rust implementation of a load balancer and autoscaler for MySQL web tier.
-
Safe SQL : Safe SQL provides sanity checks for common DB pitfalls(so you don’t delete prod DB) ; available as a python package(CLI included). 300+ monthly PyPI Downloads.
-
Stock Tank : End to End ML pipeline to predict stock prices(upto 30 days). Automate retraining, evals and more(Github Actions). Streamlit Web App available as well.
-
Gym Tunes : GymTunes is a simple AI agent that schedules a random playlist into your GCalendar based on your vibe.
-
AI GF : A small weekend project that through which you can create a virtual girlfriend(not made for imitation, but for learning)
-
Synapse : Synapse is hackernews-type platform that can be used by any community as a forum. Tried making this for my college, but need more inspiration.
-
Pandora : Pandora is domain agnostic framework for case study generation and solving.
-
Verizon : A Git like version control system, from scratch, in Python, spelled out.
-
YeetCode : YeetCode is a sassy version of Python made for all GenZ people. The aim is to create a new programming language which is bussin’.
-
Blaze : Developed a RAG(Retrieval Augmentation Generation) system by using Cohere LLM and Metaphor as a part of recruitement process for Metaphor, which is made using Langchain, Chainlit and deployed on Huggingface. 8+ Github stars.
-
CodeSmith : Developed a ChatGPT-inspired chatbot trained on a Python programming problems on custom created dataset, made using Langchain, and deployed on Huggingface.
-
Alzhemist : One of the first projects that got me in to the world of Attention. A Deep Learning Model to see which classifies Brain MRI on the basis of the dementia (AD). The images are classified as follows - Mildly Demented, Moderate Demented, Non Demented, Very Mild Demented.
-
Maxwell : One of my most priced possession. Maxwell is twisted take on One Shot Frequency Dominant Neighborhood Search. The scheme provided in the paper is a bit modified to generate fingerprint for an image.
-
SpiceyDicey : SpicyDicey is a end to end machine learning project that aims to predicts the number that appears on a dice. All of the work in collecting the data and editing the images has been done individually and from scratch.
Here are some of the awesome notebooks 📓 I’ve made on Kaggle(I’m a 2x Kaggle Expert also !!) :
-
FC Barcelona is Back! : Analyzed FC Barcelona’s LaLiga performance in the 2022-23 season on Kaggle, achieving Bronze Medal and 200+ views apart from receiving recognition from Kaggle.
-
BART Pretrainig from Scratch : Developed a BART model from scratch using Huggingface on Shakespeare dataset in a notebook on Kaggle, which received a silver medal and 600+ views.
-
Tensorflow Recommendation System : Demonstrated on using Tensorflow Recommendation System in a Kaggle notebook that gained bronze medal, and 500+ views.
Experience 👷 :
-
AI Engineer Intern, TurboML: Shipped end to end self hosted video generation(700K+ impressions), image generation/editing(2.7M+ impressions), and customized meme generation to Whatsapp, utilizing state of the art diffusion models. Facilitated resolution of STT server memory outage, leading to 63% decrease in memory usage
-
DevOps Engineer, Strykr.ai: Primary Backend Engineer of Strykr.ai, where I worked on implementing request caching, response streaming, and async API call which reduced latency by 6 seconds and facilitated deployment migration of the said application from Render to Railway.
-
Project Intern, Solvendo India Private Limited: Worked with the Machine Learning Team on : a) A production LLM RAG application b) developing time series machine learning models for predict volatility of a stock(Deep Learning Based, GluonTS).
-
Research Intern, Indian Institute of Technology-Banaras Hindi University: Worked under Prof. A.K. Singh on a research project on developing a machine translation system for low resource languages such as Hindi, Bhojpuri, Magahi, Maithali.
-
President, Big Data Analytics and Consulting Cell(National Institute of Technolgy, Warangal) : Lead the BDACC team for the academic year 2024-2025 after being the member for 2 years in a team that has collaborated in several of the student club events such as Kaggle, Pytorch Workshop and Case study competitions, among other initiatives to develop a community of machine learning enthusiasts in NIT Warangal.
-
Executive Member, Research and Development Cell(National Institute of Technolgy, Warangal): Part of Undergraduate Research Association team of NIT Warangal which actively takes part in educating and fostering academic growth among undergraduate students.
About Me 🙇♂️:
I am Yashovardhan Srivastava(quite a mouthful, so Yash is good) a passed out undergraduate engineering student from National Institute of Technology, Warangal(2025). From a young age, I have been fascinated by computers. As I grew older, this fascination turned into crush and crush turned into love - and from that moment, I haven’t looked back. I am obsessed with programming. My obsesison has made me someone who likes to build a lot - and that has had a huge ROI for me. I believe I am capable enough to produce some original work, I try to do so from time to time, producing projects that help me deepen my understanding of something I’m interested in. That, for me is the definition of luxury - working on things you like.
All of projects are a result of extreme dedication, meticulousness, and hardwork. Most of them are just random thoughts that I once had, and I thought echoing in my brain that - I can build that. The journey from there to actually making something worth checking out has made me a somewhat decent programmer.
Career Goals 🥅:
I try to practice the luxury I talked about - I want to study things that fascinate me, and build things that make me happy.
The place where I come from, this is NOT a trend. We are hard-working, talented people-but we realise very later in life what matters to us. True satisfaction comes from happiness-and that is the purpose of life.
Anyone who has studied Engineering(especially in India) will tell how “academically challenging” and “rigorous” the degree is - an I believe it is somewhat justified. Engineers are responsible for building the “lego blocks” on which society stands, and when you work for the greater good of society, you better be the best. I always say, Engineering is a great profession in theory, but(especially here, in India), really bad in practice. I guess my love for engineering did not fade away even after going through this system, so doing this art of “scale up” is something I can see myself working in.
This is the reason why I’m a researcher in free time, and an engineer full time. That is, I come up with new ideas, and try testing them at various scales.
What do I look for in a workplace?
I’ve been workin in startups from some time, and after giving countless interviews and havin chats with founders, here’s my few requirements that I look for in a workplace:
-
Innovation over bureaucracy: In most organization, great ideas get tossed into the bin due to bureaucracy. I feel in a good org, ideas flow freely, and innovation is valued more highly - regardless of your position.
-
Mission over projects: An org with a clear mission works in a calculated fashion than an org that hops from project to project. Engineers thrive in environments where there contributions serve a greater mission.
-
Teamwork over individuality: A talented team from which I can learn is the best, and is something I look for positively. I am by no means expert, and with a talented team I try to learn how to effectively communicate, solve problems and basically manage tasks that are given to me.
Other than that, basic requirements such as good and timely pay, flexibility in working hours, and a sense of respect from the organization would be ideal.
Proficiency and Interests ⭐ :
- Research Interests : Natural Language Processing, AI.
- Languages : Programming(C, C++,Python, R, Julia)
- Frameworks: Tensorflow, Pytorch, Keras(and anything if given enough time)
Achievements 🥇
Honours
- Promoted as the President of Big Data Analytics and Consulting Cell(National Institute of Technology, Warangal) for the year 2024-2025.
-
Received Kaggle Notebook and Dataset Expert with an overall rank of 699 and 573 respectively.
- Personal Interests : Football, badminton, avid reader, philosophy connoisseur and writing.
Research Papers I Love 📎
In no particular order, I am listing some really awesome research papers that in one way or other have helped me think outside of the box.
-
Interpretability in the Wild - IOI Circuit Identification : A very detailed and understandable paper on circuit identification for mechanistic interpretability. Highly recommended if you want to understand how to design your own experiments.
-
The Hardware Lottery : Probably one of my favorite papers till day. The way Sara Hookor explained how AI/ML research should proceed, and how is it going till now is a real eye opener. Highly highly recommend if you want to look at the bigger picture of AI research.
-
Scaling Scaling Laws For Board Games : Andy Jones is a genius. This paper explained how we can use shorter experiments to predict outcomes of larger experiments - which are resource heavy. Highly recommend if you want learn how scale up works in real life.
-
Building Machines that Learn and Think for Themselves - Commentary on Lake, Ullman, Tenenbaum, and Gershman : Not exactly a paper, but this really forced me to think about some things. Definitely recommend this for casual reading.
-
A Two-Systems Perspective for Computational Thinking : This is one of the first papers that I read and it blew my mind. Inspired by the Kahneman’s Two Systems Approach of Thinking(Thinking Fast and Slow), this papers presents the cognitive models against which computational thinking can be analyzed and evaluated.
-
Attention is all you need : It’s everyone’s favourite research paper-and mine too. This was the paper that introduced transformers, and the rest is history. This paper taught me how to communicate your research and how to present your work.
-
Recsim-A configurable platform for recommender systems : This opened my eyes. I was in awe when I found out we can use reinforcement learning in recommendation setting. I even emailed the author of the paper thanking and asking him what he thinks whether this will be used in future recommendation systems. Google Research for the WIN.
-
Improving Low-Resource NMT through Relevance Based Linguistic Features Incorporation: This was a really well written and structured paper, which I was able to understand easily, and even used for testing in my internship project.