Matteo Villosio

Matteo Villosio Matteo Villosio

AI Lead and Trail Runner

Tinexta

GlobalAI Association

StAI Tuned

Biography

Matteo Villosio is AI Lead at Tinexta Group, where he conceived and launched LextelAI, now Italy’s leading AI assistant for lawyers and legal professionals, and is currently advancing large‑language‑model and agent‑based solutions across the group’s businesses.

In parallel, he co‑founded DatAIMed and drives its AI vision, orchestrating autonomous‑agent pipelines and a multi‑collection MongoDB vector database that indexes more than 150 million scientific papers to deliver real‑time, bias‑checked clinical insights. In this role he recruits and mentors high‑performance AI teams, forges collaborations with hospitals, CROs and universities, and aligns product strategy with clinical and market needs.

Earlier, as the first Data Scientist at Greenomy, Matteo built the firm’s inaugural deep‑NLP system and earned top honours at the Swift Hackathon. He has designed machine‑learning solutions for audit analytics at Generali and data‑engineering pipelines at Flowe, conducted large‑scale social‑media research at SmartData@PoliTO, and led projects at the NGO FAWLTS to narrow the education‑to‑employment gap.

Matteo also serves as a member of GlobalAI, the Swiss‑based non‑profit that represents AI stakeholders before the United Nations and other international bodies, promoting the responsible, sustainable and ethical development of artificial intelligence worldwide.{style=“text-align: justify;”}

Interests
  • Natural Language Processing
  • Large Language Models
  • LLM Agents
  • Retrival Augmented Generation
  • Spatial Data Science
Education
  • Generative AI with Large Language Models MOOC, 2023

    DeepLearning.ai

  • The materiality of ESG Factors Specialisation, 2023

    The Wharton School

  • MSc in Data Science and Engineering, 2021

    Polytechnic University of Turin

  • BSc in Computer Engineering, 2019

    Polytechnic University of Turin

Skills

Technical Skills
Generative AI
Data Science
Retrival Augmented Generation
Soft Skills
Teamwork
Leadership
Problem Solving

Experience

 
 
 
 
 
Tinexta Group
AI Lead
March 2025 – Present Remote

AI Lead at Tinexta Group

  • Lead the development of two now available Generative AI solutions (RAG, Multiagent, Graph‑based).
  • Proven ability to effectively communicate complex AI solutions to clients, ensuring their needs are met and fostering strong, collaborative relationships
  • Mentoring of 2 juniors.
  • Lead two classes introducing the use of LLMs to technical stakeholders.
 
 
 
 
 
Tinexta Group
Artificial Intelligence Specialist
April 2024 – Present Remote

Artificial Intelligence Specialist at Tinexta Group

  • Implemented multiple LLM Agent and Multiagent Solutions.
  • Improved the performances of our search pipelines of more than 50%.
  • Lead the creation of an ElasticSearch Database currently storing more than 100 Milions Legal Documents
  • Created more than 20 high performance ElasticSearch queries, lowering the latency of more than 30% with respect to the previous solutions
 
 
 
 
 
DatAIMed
Cofounder and AI Lead
April 2025 – Present Remote

Co‑founded DatAIMed and shaped its AI vision, steering the company’s agent‑driven, evidence‑based‑medicine platform from concept to scale.

  • Set the AI strategy & roadmap — aligned proprietary product direction with clinical and market needs.
  • Led cross‑functional execution — coordinated engineering, data‑science and clinical teams to deliver agent‑powered workflows end‑to‑end
  • Built a Paper vector database — architected a multi‑collection MongoDB store for lightning‑fast retrieval of peer‑reviewed literature.
  • Forged high‑impact partnerships — secured collaborations with leading partners, clinicians and organisations to validate and deploy our solutions.
  • Scaled a top‑tier AI org — hired, mentored and retained talent across research, MLE and product, cultivating a high‑performance culture.
 
 
 
 
 
www.matteovillosio.com
Freelancer
April 2025 – Present Remote

Provided Consultancy and Freelancing work for:

  • SetUP Aosta
  • Tinexta Innovation Hub
 
 
 
 
 
ErisForge
Creator and Maintainer
October 2024 – Present Remote
ErisForge is a Python library designed to modify Large Language Models (LLMs) by applying transformations to their internal layers. Named after Eris, the goddess of strife and discord, ErisForge allows you to alter model behavior in a controlled manner, creating both ablated and augmented versions of LLMs that respond differently to specific types of input.
 
 
 
 
 
Global AI Association
Member
April 2024 – Present Geneva, Geneva Canton, Switzerland

GlobalAl, the international Association aiming to represent Al players (both developers and users) to international institutions, was established in 2023 as a non-profit entity following Swiss law.

  • GlobalAl aims at promoting the positive development, sustainable growth and ethical application of Artificial Intelligence technology worldwide.
  • Its primary goal is to empower its Members with representation and voice at the United Nations and other pertinent global entities, in order to foster a conducive and unified regulatory environment.
 
 
 
 
 
Greenomy
NLP Data Scientist
November 2022 – April 2024 Brussels, Brussels Region, Belgium (Hybrid)

First Data Scientist hired at Greenomy

  • Created first Deep NLP system @ Greenomy.
  • Lead the development of Artemis, the AI Advisor by Greenomy.
  • Won 2nd place at the Swift Hackathon “Let’s innovate for a sustainable future”
  • Implemented multiple LLM use cases.
  • Development of LLM‐based DB‐connected ChatBots
  • Onboarding and Mentoring of new joiners.
 
 
 
 
 
Generali Insurance
Group Audit Data Scientist
April 2022 – November 2022 Milan, Lombardy, Italy
  • Responsible of Data Science operations @ Group Audit
  • Design and implementation of Machine Learning, Deep Learning and NLP architectures applied to Auditing and Control functions.
  • Exploration of structured and unstructured data and ETL pipeline design.
 
 
 
 
 
Flowe - Mediolanum Bank
Junior Data Engineer
September 2021 – April 2022 Milan, Lombardy, Italy
  • Researched and implemented the first completely in-house Deep Learning model.
  • Research and Development of Machine Learning and Deep Learning Models
  • Data Exploration and Analysis
  • ETL procedures
 
 
 
 
 
FAWLTS
Head of Hub and Core Team Member
January 2022 – Present Pinerolo, Piedmont, Italy (Remote)
  • FAWLTS is an NGO that bridges the gap between schools and the job market, helping students understand what they want to do when they grow up
  • Organization of events and activities
  • Management of a Team of more than 10 people.
  • Organised and lead multiple events and activities with more than 200 participants
 
 
 
 
 
StAI Tuned
Editorial Contributor
June 2022 – Present Remote
  • Contributed insightful articles on practical applications of Artificial Intelligence, generating thousands of views and fostering community engagement.
  • Engaged in knowledge-sharing and discussions within an open AI community, helping to demystify AI concepts and applications for a wider audience.
  • Played a key role in creating and nurturing a space for AI enthusiasts and professionals to connect and share knowledge.
 
 
 
 
 
SmartData@PoliTO
Master Thesis Researcher
March 2021 – December 2021 Turin, Piedmont, Italy
  • Developed and evaluated two advanced algorithms, a Random Forest Regressor and a Recurrent Neural Network, for forecasting popularity trends in Online Social Networks, utilizing a dataset comprising over 2 million posts from 1611 Italian Instagram influencer profiles.
  • Engaged in data-driven analysis, focusing on understanding the interactions and popularity dynamics within social media platforms, and the impact of various factors on content popularity.
  • Implemented various data science and machine learning techniques, including data exploration, model development, and performance evaluation, using tools and technologies such as Python, PySpark, MySQL, and more.
  • Achieved satisfactory prediction results despite the challenges posed by data limitations, extreme metric variances, and the presence of numerous outliers, contributing meaningful insights into the field of social media analytics and popularity forecasting.
 
 
 
 
 
EiSWORLD (now Orbyta)
Database Developer and Full Stack Developer Intern
April 2019 – June 2019 Turin, Italy
  • Engaged in full-stack development and database management, contributing to various projects during the internship period.
  • Worked with a stack including Flask, MySQL, SQL, Ubuntu, and Python, implementing robust and efficient database solutions.
  • Utilized data modeling techniques and worked with relational databases to ensure data integrity and optimal performance.
  • Collaborated with cross-functional teams, contributing to the development lifecycle from database design to full-stack implementation and optimization.
 
 
 
 
 
GEC - Giochi Elettronici Competitivi
News Editor
February 2015 – March 2015 Remote
  • Specialized in crafting engaging and informative news articles and editorials focused on the videogame and E-Sports industry.
  • Conducted thorough research to stay abreast of the latest trends and developments within the gaming community.
  • Contributed to the editorial team by consistently delivering high-quality content, enhancing the publication’s reputation and reader engagement.
  • Utilized strong writing skills to break down complex gaming topics for a diverse audience, promoting a deeper understanding of E-Sports and gaming culture.

Projects

*
ErisForge
ErisForge: Customizing LLM Behaviors for Enhanced Control and Research.
ErisForge
The Unofficial Haystack Documentation Chatbot
A simple agent chatbot that you can use to chat with the documentation of the famous NLP framework Haystack.
The Unofficial Haystack Documentation Chatbot
Kompy
An easy-to-use Python Wrapper for Komoot APIs.
Kompy
Pic Selector
A little streamlit app to select pictures from holidays using a UX similar to Tinder.
Pic Selector
Data-driven Analysis of Interactions and Popularity Increase in Online Social Networks
The research analyzes popularity trends among influencers. Employing both classical ML algorithms and Neural Networks on data from 1611 profiles (2015-2021), it achieves satisfactory predictions of post reactions, despite some data limitations.
Data-driven Analysis of Interactions and Popularity Increase in Online Social Networks
MixCarl a Novel Approach to iCaRL and Incremental Learning
This paper explores incremental learning in machine learning, aiming to enhance system knowledge over time, crucial for IoT and social media applications. The authors review and implement state-of-the-art methodologies, including iCaRL, and propose new variations for specific use cases. They evaluate catastrophic forgetting, establish benchmarks, and seek to improve performance in continuous learning scenarios.
MixCarl a Novel Approach to iCaRL and Incremental Learning
Simulated Annealing for Optimal Scheduling of University Exams
A project for the course of Operational Research at Politecnico di Torino. The goal was to find the optimal schedule for the exams of the Computer Engineering course.
Simulated Annealing for Optimal Scheduling of University Exams