Home
I do write a lot of stuff, both technical and non-technical, so do check them out. Here are my technical blogs, and here is my unofficial diary. Apart from that, If you want to contact me, here are my socials
Why this website ?
This page will serve as a navigator for some of my most loved and cherished projects. While the good stuff is about to come, I hope what I love doing doesn’t get unnoticed.
Should you hire me?
The short answer is yes. Here’s a big explanation on what I bring to the table :
-
High Agency : This is the most important skill for me. If I am working on a project that I like, chances are I’m going to anything to make it happen(no one ever teaches this!). I have a great blend of both research and engineering skills, so if I’m invested in a project, you can bet on me that I’m going to complete it by any means.
-
Team Player : I have been part of, and lead teams at college level in ML/AI domain. I have successfully conducted various workshops, and hosted events(details in this page). Apart from that, I’ve also been part of teams at industrial level(see next point). I know the power of teamwork, and know how to use it to my full advantage.
-
Experience : I have experience in both academia and research setting, and I’m pretty comfortable with conducting independent research(with little pushes in the right direction working best). I’ve worked in a startup to make a RAG system for financial data and I’ve also built a machine translation system(Transformer based, from scratch) for low-resourced languages as a part of a research internship - which demonstrates that I’m well versed in programming and have experience to back that up as well.
-
Thinking Fast and Slow : I’ve worked on various projects independently, both research and engineering(references in this page itself, so scroll down) and my work has been appereciated by the open source community as well(see next point). I’m someone who is great both in zero shot and chain of thought abilities, so you can bet on me.
-
Quantifiable work : I have achieved 2x Kaggle Expert, created popular Python packages and kernels( which have received appreciation from Kaggle as well). I have also made open-source projects that have been praised on HN, GitHub, Twitter (multiple instances on Twitter).
About Me 🙇♂️:
I am Yashovardhan Srivastava(quite a mouthful, so Yash is good) an undergraduate engineering student in National Institute of Technology, Warangal. From a young age, I have been fascinated by computers. As I grew older, this fascination turned into crush and crush turned into love - and from that moment, I haven’t looked back. I believe open source projects has played a significant role in that. They made me fall in love with research, development and much more. Since now I believe I am capable enough to produce some original work, I want to be a part of this beautiful journey, in which all of you play an important role.
All of projects are a result of extreme dedication, meticulousness, and hardwork. Most of them are just random thoughts that I once had, and thought-What If ? They do not need recognition, they need discussions. I might have reached a dead end with some of those - but their cycle isn’t complete. I have plenty of projects in pipeline, which I hope will be just beautiful as the ones which are already there.
Career Goals 🥅:
Wooh. That’s a tough one. There are many things that I like and I feel it is difficult to commit to something. But, there comes a time when we need commit to a field. Balance between exploration and exploitation needs to be made-and for me, that comes from working on research problems. I want to study more, to do things that make me happy.
The place where I come from, this is NOT a trend. We are hard-working, talented people-but we realise very later in life what matters to us. True satisfaction comes from happiness-and that is the purpose of life. I choose to take the road not taken.
I see myself as a research scientist/research engineer in the near future-specifically in the field of AI, an I want to highlight why this field speaks to me a lot.
Anyone who has studied Engineering(especially in India) will tell how “academically challenging” and “rigorous” the degree is - an I believe it is somewhat justified. Engineers are responsible for building the “lego blocks” on which society stands, and when you work for the greater good of society, you better be the best. I always say, Engineering is a great profession in theory, but(especially here), the adaptaion is really not good. I guess my love for engineering did not fade away even after going through this system, so I guess working on “scale up” is something I can see myself working in. It is really fascinating ince you get the hold of it.
Another interesting thing that I like is coming up with ideas and test them using the scientific method. The joy of discovering really speaks to the creative side of me, and I would really love if I can make a career in this.
Those are some of the reasons why I want to be a researcher + engineer. That is, coming up with new ideas, and testing them at various scales of production. This is something that I dream of as of now, but let’s see where life takes me.
Experience 👷 :
-
[NEW!] Project Intern, Solvendo India Private Limited: Worked with the Machine Learning Team on : a) A production LLM RAG application b) developing time series machine learning models for predict volatility of a stock(Deep Learning Based, GluonTS).
-
Research Intern, Indian Institute of Technology-Banaras Hindi University: Worked under Prof. A.K. Singh on a research project on developing a machine translation system for low resource languages such as Hindi, Bhojpuri, Magahi, Maithali.
-
[NEW!] President, Big Data Analytics and Consulting Cell(National Institute of Technolgy, Warangal) : Lead the BDACC team for the academic year 2024-2025 after being the member for 2 years in a team that has collaborated in several of the student club events such as Kaggle, Pytorch Workshop and Case study competitions, among other initiatives to develop a community of machine learning enthusiasts in NIT Warangal.
-
Executive Member, Research and Development Cell(National Institute of Technolgy, Warangal): Part of Undergraduate Research Association team of NIT Warangal which actively takes part in educating and fostering academic growth among undergraduate students.
Proficiency and Interests ⭐ :
-
Research Interests : Natural Language Processing, Mechanistic Interpretability.
-
Building in : Artificial Intelligence Research, Data Science, Natural Language Processing and Machine Learning Research.
-
Languages : Spoken(Hindi, English, Very Basic Spanish), Programming(C, C++,Python, R, Julia)
-
Frameworks: Tensorflow, Pytorch, Keras(and anything if given enough time)
Achievements 🥇
Honours
- Promoted as the President of Big Data Analytics and Consulting Cell(National Institute of Technology, Warangal) for the year 2024-2025.
-
Received Kaggle Notebook and Dataset Expert with an overall rank of 699 and 573 respectively.
- Personal Interests : Football, badminton, avid reader, philosophy connoisseur and writing.
Reseach Papers I Love 📎
In no particular order, I am listing doen some really awesome reserach papers that in one way or other have helped me think outside of the box.
-
Interpretability in the Wild - IOI Circuit Identification : A very detailed and understandable paper on circuit identification for mechanistic interpretability. Highly recommended if you want to understand how to design your own experiments.
-
The Hardware Lottery : Probably one of my favorite papers till day. The way Sara Hookor explained how AI/ML research should proceed, and how is it going till now is a real eye opener. Highly highly recommend if you want to look at the bigger picture of AI research.
-
Scaling Scaling Laws For Board Games : Andy Jones is a genius. This paper explained how we can use shorter experiments to predict outcomes of larger experiments - which are resource heavy. Highly recommend if you want learn how scale up works in real life.
-
Building Machines that Learn and Think for Themselves - Commentary on Lake, Ullman, Tenenbaum, and Gershman : Not exactly a paper, but this really forced me to think about some things. Definitely recommend this for casual reading.
-
A Two-Systems Perspective for Computational Thinking : This is one of the first papers that I read and it blew my mind. Inspired by the Kahneman’s Two Systems Approach of Thinking(Thinking Fast and Slow), this papers presents the cognitive models against which computational thinking can be analyzed and evaluated.
-
Attention is all you need : It’s everyone’s favourite research paper-and mine too. This was the paper that introduced transformers, and the rest is history. This paper taught me how to communicate your research and how to present your work.
-
Recsim-A configurable platform for recommender systems : This opened my eyes. I was in awe when I found out we can use reinforcement learning in recommendation setting. I even emailed the author of the paper thanking and asking him what he thinks whether this will be used in future recommendation systems. Google Research for the WIN.
-
Improving Low-Resource NMT through Relevance Based Linguistic Features Incorporation: This was a really well written and structured paper, which I was able to understand easily, and even used for testing in my internship project.
Projects 🧰
Here I’ll pin some of my favorite projects, more on the research 👨🔬 side(Feel free to critique me on this(and try to contribute if possible :) ) :
-
Arrakis (Looking for contributors) : Arrakis is a library to conduct, track and visualize mechanistic interpretability experiments.
-
Deeprobe (Looking for ideas) : Deeprobe is a study to understand feature importance and pattern undderstaing in Sparse Autoencoders using Monte Carlo Tree Search.
-
SAE Macaronic Languages : Understand whether language models learn words beyond language barriers, a study in Mechanistic Interpretability.
-
Secure BPE (Work in Progress) : A modified, secure version of Byte Pair Encoding algorithm.
-
Collaborative Debating (Work in Progress) : A hacky implementation of the paper “Improving Factuality and Reasoning in Language Models through Multiagent Debate”.
-
NEAT-JAX (Work in Progress) : An implementation of Neuroevolution of Augmented Topologies Algorithm in JAX which is compatible with EvoJAX.
-
Nexus Theory : Can we really trust our human-ness for the messages that we send into the cosmos? Nexus theory is a gamified version to understand machine learning interpretability using Large Language Models.
-
Elixr : Elixr an autograd library using Complex Numbers similar to Pytorch.
-
Attention Free Revolution : Developed Leviathan architectures, and alternate to Transformer architecture using a modified attention scores, taking inspiration from signal processing.
-
P-GLAm : P-GLAm is a random thought experiment on Infinite Monkey Theorem. In this, I developed a GPT-2 inspired Large Language Model which aims to test the arithmetic correctness.
Here I’ll pin some of my favorite projects, more on the development 💻 side. Feedback is always appreciated for projects like these.:
-
Dynamo : Dynamo is a Python/Rust implementation of a load balancer and autoscaler for MySQL web tier.
-
Safe SQLO : Safe SQL provides sanity checks for common DB pitfalls(so you don’t delete prod DB) ; available as a python package(CLI included)
-
Stock Tank : End to End ML pipeline to predict stock p[rices. Automate retraining, evals andd more(Github Actions). Web App available as well.
-
Gym Tunes : GymTunes is a simple AI agent that schedules a random playlist into your GCalendar based on your vibe.
-
AI GF : A small weekend project that through which youcan creaate a virtual girlfriend(not made for imitation, but for learning)
-
Synapse : Synapse is hackernews-type platform that can be used by any community as a forum. Tried making this for my college, but need more inspiration.
-
Pandora : Pandora is domain agnostic framework for case study generation and solving.
-
Verizon : A Git like version control system, from scratch, in Python, spelled out.
-
YeetCode : YeetCode is a sassy version of Python made for all GenZ people. The aim is to create a new programming language which is bussin’.
-
Blaze : Developed a RAG(Retrieval Augmentation Generation) system by using Cohere LLM and Metaphor as a part of recruitement process for Metaphor, which is made using Langchain, Chainlit and deployed on Huggingface.
-
CodeSmith : Developed a ChatGPT-inspired chatbot trained on a Python programming problems on custom created dataset, made using Langchain, and deployed on Huggingface.
-
Alzhemist : One of the first projects that got me in to the world of Attention. A Deep Learning Model to see which classifies Brain MRI on the basis of the dementia (AD). The images are classified as follows - Mildly Demented, Moderate Demented, Non Demented, Very Mild Demented.
-
Maxwell : One of my most priced possession. Maxwell is twisted take on One Shot Frequency Dominant Neighborhood Search. The scheme provided in the paper is a bit modified to generate fingerprint for an image.
-
SpiceyDicey : SpicyDicey is a end to end machine learning project that aims to predicts the number that appears on a dice. All of the work in collecting the data and editing the images has been done individually and from scratch.
Here are some of the awesome notebooks 📓 I’ve made on Kaggle(I’m a 2x Kaggle Expert also !!) :
-
FC Barcelona is Back! : Analyzed FC Barcelona’s LaLiga performance in the 2022-23 season on Kaggle, achieving Bronze Medal and 200+ views apart from receiving recognition from Kaggle.
-
BART Pretrainig from Scratch : Developed a BART model from scratch using Huggingface on Shakespere dataset in a notebook on Kaggle, which received a silver medal and 600+ views.
-
Tensorflow Recommendation System : Demonstrated on using Tensorflow Recommendation System in a Kaggle notebook that gained bronze medal, and 500+ views.
Much is yet to come, so keep an 👀