Raj Purohith Arjun

           

Masters in Data Science

Texas A&M University

Department of Computer Science and Engineering

Student

Institute of Data Science




Current Project
Fine Grained Attribute Grounding

Projects

Go to my Github page to look my work and collaborate.

Prophetic Sentinel : Anticipatory Maintenance System


Crafted an anticipatory maintenance system using Python (scikit-learn, TensorFlow), SQL, and Azure leveraging machine learning to prophesy equipment failures in data centers. This proactive approach curtailed downtime and slashed maintenance expenses significantly. Garnered a striking 90% accuracy in foretelling equipment failures, leading to a notable 30% drop in maintenance costs.


AI Video summarization


Developed using Mixtral, Whisper, and AWS, integrating language models and tools like GPT-3, BERT, and FFmpeg for efficient video processing. Set up AWS EC2 instances to run large models, reduce latency, and transcribe audio to text with Whisper. Implemented dynamic quiz generation and a feedback system using Flask, HTML, and JavaScript, storing user data in a database. The project highlighted the strengths and limitations of various language models and their practical application in video summarization and interactive quizzes.


AI-driven Financial Fraud Detection Using Deep Learning and XAI


Developed an AI-based system to detect financial fraud in real-time, leveraging deep learning models like Transformers, CNNs, and GANs for fraud identification and simulation of forged transactions. Use EfficientNet for image-based fraud detection, while integrating Explainable AI (XAI) techniques to enhance model transparency and ensure ethical decision-making. Visualize fraud pa erns and risks using Tableau or Seaborn, aiding stakeholders in making informed, data-driven investment decisions. Achieved 95% accuracy in detecting fraudulent transactions, reducing false positives..


End- to-End MLOPs Pipeline for Loan Eligibility Prediction


Built MLOps pipeline for a Loan Eligibility Prediction model using Python, deployed on Google Cloud Platform (GCP). The pipeline involved creating a Flask API, containerizing it with Docker, and managing source code through Cloud Source Repository and Git.Automated deployment was handled via Cloud Build, and the model was deployed using Cloud Run. This project demonstrated efficient cloud architecture, leveraging GCP services for scalable and automated machine learning operations.


PersonaCraft: Tailored Content Recommender


Engineered a content recommendation system driven by machine learning to tailor content suggestions for individual users. This bespoke approach fostered a substantial increase in user engagement. Amplified user engagement by a commendable 25% through personalized content recommendations.


Ensemble Model for Vehicle Tracking and ANPR


Leveraging the robust capabilities of YOLOv5, YOLOv8, DeepSORT, and Easy OCR, engineered an ensemble model tailored for number plate recognition (ANPR) and vehicle tracking, particularly excelling in low light conditions. Achieving an impressive F1 Score of 0.97, this model stands as a testament to its efficacy in challenging environments. Complemented by a user-friendly web interface, it emerges as a versatile and robust solution for ANPR.




SKILLS

  • Python / SQL / R / C++
  • TensorFlow-Keras / Pytorch / spaCy / NLTK / Sci-kit
  • Seaborn / Matplotlib / Numpy / Pandas /Dplyr / OpenCV / ggplot2
  • Deep Learning / Machine learning Algorithms /Artificial Intelligence
  • Big Data / Git / Github /Apache Spark /BigML / PowerBI / YOLO
  • Data Structures/ LLM / AWS / Microsoft Azure/ Computer Vision
  • Project Management /Data Analysis/Data Pipeline/ Storytelling /Critical thinking / Problem Solving







  • Professional Experience

    Education

    • 2024-26: Masters in Data Science , Texas A&M University - GPA : 3.5/4.0
    • 2020-24: B.Tech in Computer Science Engineering with specialization
      in AI and ML , SRM University - CGPA : 9.64/10

    Internships

    Entropik Technologies (Machine Learning Engineer) - Remote, India

    • Led the development and optimization of AI-powered consumer research solutions using advanced machine learning models and techniques, driving significant improvements in customer insights and engagement.
    • Designed and implemented machine learning models to analyze and predict consumer behavior using advanced AI techniques, including Emotion AI, Behavior AI, and Predictive AI, resulting in more accurate consumer insights.
    • Utilized Python, Pandas, and NumPy to manage large-scale consumer data sets, processing millions of data points to uncover actionable insights and enhance model performance.
    • Applied supervised learning techniques, including logistic regression and support vector machines (SVM), to predict customer behavior and tailor personalized marketing strategies, improving campaign effectiveness by 25%.
    • Enhanced predictive accuracy through data preprocessing techniques such as normalization, scaling, and outlier detection, ensuring more reliable results in consumer behavior forecasting.
    • Developed and fine-tuned recommendation systems for personalized consumer experiences, leveraging collaborative filtering and content-based methods to improve user engagement and satisfaction by 20%.
    • Employed advanced data visualization tools like Matplotlib and Seaborn to present complex consumer behavior patterns and trends, helping stakeholders easily interpret data and make informed decisions.
    • Led the implementation of AI-powered insights into consumer preferences and behaviors, driving 30% more accurate product recommendations and improving marketing ROI.
    • Utilized SQL and cloud platforms (AWS, GCP) to streamline data pipelines and optimize data storage and retrieval processes, reducing data processing times by 40%.
    • Collaborated with cross-functional teams, including product managers and marketing analysts, to ensure AI models aligned with business goals and consumer expectations.
    • Delivered technical presentations to both technical and non-technical stakeholders, effectively communicating the impact of AI-driven consumer insights and providing actionable recommendations for improving consumer engagement.

    High Radius (Data Analyst) - Chennai, India

    • Built and deployed a comprehensive AI-enabled Fintech B2B cloud application, focusing on creating a scalable, full-stack web-based product.
    • Leveraged Python libraries like Pandas, NumPy, and Scikit-learn for in-depth data analysis, as well as JavaScript for interactive data visualization.
    • Conducted extensive data preprocessing and feature engineering, including data cleaning, wrangling, normalization, and scaling, to prepare datasets for predictive modeling.
    • Employed text vectorization techniques to classify user segments for targeted financial services.
    • Led the data analysis and machine learning aspects, implementing classification models such as Gradient Boosting, XGBoost, and Random Forest with extensive parameter tuning.
    • Evaluated model performance using metrics like accuracy, precision, recall, and F1-score to select the most effective model for financial risk prediction.
    • Utilized advanced anomaly detection techniques to identify payment discrepancies, increasing fraud detection capabilities by up to 90%.
    • Conducted A/B testing to analyze transaction patterns and collaborated with marketing teams, leading to targeted campaigns that improved transaction rates.
    • Executed SQL operations, including complex joins and data aggregation, to streamline the ETL process, enhancing data transformation and storage in a centralized database.
    • Set up new database schemas and optimized queries to improve data management and retrieval speed.
    • Developed automated data transformation workflows, converting raw data into formats suitable for analysis. Created an experimental framework for automated data collection and built real-time approval systems, reducing manual intervention and increasing productivity by 30%.
    • Collaborated with cross-functional teams, including UI/UX designers and backend developers, integrating machine learning insights into the application to enhance user experience.
    • Visualized data insights using tools like Tableau and R Markdown, providing detailed exploratory analysis and uncovering key business trends.
    • Identified key areas for procedural improvement through customer data analysis, providing actionable insights that enhanced decision-making and profitability.
    • Applied various clustering techniques to detect underperforming segments, leading to strategic adjustments that boosted overall system efficiency.
    • Maintained a high standard of attention to detail in handling data and building models, ensuring the accuracy and reliability of predictions. Effectively communicated insights through written reports and presentations to stakeholders, facilitating data-driven business decisions.

    Publications and Awards
    • 2024 Won Best Paper Award at International Conference on Computing Technologies for Sustainable Development-2024 for "Innovation in Vehicle Tracking : Harnessing YOLOV8 and Deep Learning Tools for Automatic Number Plate Detection" Check
    • 2023 Won Best Paper Award at the National Conference on Technology for the Society’23 for the research paper “ Enhancing ANPR using YOLOv8 and Deep Learning Techniques” held at SRMIST, Chennai Check
    • 2023 Received an Academic Award for Overall Proficiency Rank-1 in the Computer Science Department for the Year 2023 and 2024 , SRM University Check
    • 2023 Ranked in the top 10 out of 2500 participants in Proglint’s Alliance University Computer Vision Hackathon 2023.






    Certifications

    My Profile on Linkedin

  • Microsoft Certified: Azure Data Scientist Associate (certificate)
  • Hackerank Certified: SQL Advance Programmer (Verify)
  • Python for Data Science, AI & Development IBM (certificate)
  • Application of Machine Learning in Urban Studies (IIRS &ISRO) (certificate)
  • Neural Networks and Deep Learning by Andrew Ng (verify)
  • Introduction to Machine Learning by Debjani Chakraborty (NPTEL, IIT KGP) (verify)
  • Natural Language Processing by Haimanti Banerji (NPTEL, IIT KGP) (verify)



  • Contact


    Mailing address: Unit : 204 , The Villas of Cherry Hollow, 503 Cherry Street, College station, TX 77840

    E-mail: raj2001@tamu.edu or rajpurohitharjun58@gmail.com
    Linkdedin:  Reach me at Linkedin