{"id":3910,"date":"2025-06-23T16:25:41","date_gmt":"2025-06-23T14:25:41","guid":{"rendered":"https:\/\/exceltic.com\/?p=3910"},"modified":"2026-03-19T13:28:24","modified_gmt":"2026-03-19T12:28:24","slug":"ia-generative-avec-controle-total-petits-modeles-linguistiques-slms-et-quantification-2","status":"publish","type":"post","link":"https:\/\/exceltic.serquo.com\/fr\/ia-generative-avec-controle-total-petits-modeles-linguistiques-slms-et-quantification-2\/","title":{"rendered":"IA g\u00e9n\u00e9rative avec contr\u00f4le total : petits mod\u00e8les de langage (SLM) et quantification"},"content":{"rendered":"<p class=\"has-medium-font-size\">La g\u00e9n\u00e9ration de l'IA a chang\u00e9 la fa\u00e7on dont nous interagissons avec la technologie, mais son utilisation g\u00e9n\u00e9ralis\u00e9e pr\u00e9sente des d\u00e9fis en termes de protection de la vie priv\u00e9e, de gouvernance et d'efficacit\u00e9 des ressources. Dans cet article, nous examinons comment les petits mod\u00e8les de langage (SLM) offrent une alternative plus contr\u00f4l\u00e9e et plus efficace aux grands mod\u00e8les de langage (LLM) et comment la quantification des mod\u00e8les peut encore am\u00e9liorer leurs performances.<\/p>\n\n\n\n<h2 class=\"wp-block-heading has-tertiary-color has-text-color has-link-color has-large-font-size wp-elements-e76b0c190fbc11e2b681060fe0464846\">Qu'est-ce qu'un SLM ?<\/h2>\n\n\n\n<p class=\"has-medium-font-size\">Les petits mod\u00e8les de langage (SLM) sont des mod\u00e8les de langage form\u00e9s avec une architecture similaire aux LLM mais avec un nombre de param\u00e8tres beaucoup plus petit pour le traitement du langage naturel, la compr\u00e9hension et la g\u00e9n\u00e9ration de contenu.<\/p>\n\n\n\n<p class=\"has-medium-font-size\">Tels sont les avantages des vari\u00e9t\u00e9s :<\/p>\n\n\n\n<ul class=\"wp-block-list\">\n<li class=\"has-medium-font-size\"><strong>Faible consommation de ressources (RAM\/CPU\/GPU)<\/strong>Ils n\u00e9cessitent moins de puissance de traitement, ce qui facilite leur mise en \u0153uvre sur du mat\u00e9riel plus abordable.<\/li>\n\n\n\n<li class=\"has-medium-font-size\"><strong>Un meilleur contr\u00f4le<\/strong>peut \u00eatre utilis\u00e9 \u00e0 la fois dans des environnements locaux et priv\u00e9s. <\/li>\n\n\n\n<li class=\"has-medium-font-size\">garantit une s\u00e9curit\u00e9 et une gouvernance accrues.<\/li>\n\n\n\n<li class=\"has-medium-font-size\"><strong>Augmentation de la vitesse de d\u00e9duction<\/strong>Les r\u00e9ponses sont g\u00e9n\u00e9r\u00e9es plus rapidement et plus efficacement car il y a moins de param\u00e8tres.<\/li>\n<\/ul>\n\n\n\n<p class=\"has-medium-font-size\">Exemples de SLM connus : <\/p>\n\n\n\n<ul class=\"wp-block-list\">\n<li class=\"has-medium-font-size\"><strong>Mistral 7B<\/strong><\/li>\n\n\n\n<li class=\"has-medium-font-size\"><strong>Phi-2<\/strong> de Microsoft (~2,7 milliards de param\u00e8tres)<\/li>\n\n\n\n<li class=\"has-medium-font-size\"><strong>TinyLLaMA<\/strong><\/li>\n\n\n\n<li class=\"has-medium-font-size\"><strong>Gemma 2B<\/strong> de Google<\/li>\n<\/ul>\n\n\n\n<h2 class=\"wp-block-heading has-large-font-size\">Quantification des mod\u00e8les : r\u00e9duction des effectifs sans pr\u00e9cision de la mortalit\u00e9 <\/h2>\n\n\n\n<p class=\"has-medium-font-size\">L'un des principaux obstacles \u00e0 l'entra\u00eenement des mod\u00e8les d'IA est leur taille et la puissance de traitement requise. La quantification joue ici un r\u00f4le cl\u00e9. Cette technique permet de r\u00e9duire la taille du mod\u00e8le en convertissant les poids de haute pr\u00e9cision (FP32, FP16) en poids de faible pr\u00e9cision (INT8, INT4) sans affecter de mani\u00e8re significative les performances du mod\u00e8le. Les avantages sont les suivants :<\/p>\n\n\n\n<ul class=\"wp-block-list\">\n<li class=\"has-medium-font-size\"><strong>R\u00e9duction de l'utilisation de la m\u00e9moire<\/strong>Permet de stocker et de g\u00e9rer des mod\u00e8les sur des appareils \u00e0 capacit\u00e9 limit\u00e9e.<\/li>\n\n\n\n<li class=\"has-medium-font-size\"><strong>Efficacit\u00e9 accrue du GPU\/CPU :<\/strong> R\u00e9duit la charge du processeur en acc\u00e9l\u00e9rant les op\u00e9rations math\u00e9matiques.<\/li>\n\n\n\n<li class=\"has-medium-font-size\"><strong>Inf\u00e9rence acc\u00e9l\u00e9r\u00e9e :<\/strong> Les mod\u00e8les peuvent r\u00e9agir plus rapidement, ce qui r\u00e9duit la pr\u00e9cision des calculs.<\/li>\n<\/ul>\n\n\n\n<h2 class=\"wp-block-heading has-large-font-size\">Comparaison entre SLM et LLM<\/h2>\n\n\n\n<p class=\"has-medium-font-size\">Si la LLM s'est av\u00e9r\u00e9e \u00eatre un outil puissant, la SLM offre des avantages importants dans les situations o\u00f9 l'efficacit\u00e9 et la protection de la vie priv\u00e9e sont primordiales. Nous comparons ci-dessous les deux approches :<\/p>\n\n\n\n<figure class=\"wp-block-image size-full\"><img loading=\"lazy\" decoding=\"async\" width=\"741\" height=\"426\" src=\"https:\/\/exceltic.com\/wp-content\/uploads\/2025\/06\/tabla-comparacion-SLM-y-LLM.png\" alt=\"\" class=\"wp-image-3911\" srcset=\"https:\/\/exceltic.serquo.com\/wp-content\/uploads\/2025\/06\/tabla-comparacion-SLM-y-LLM.png 741w, https:\/\/exceltic.serquo.com\/wp-content\/uploads\/2025\/06\/tabla-comparacion-SLM-y-LLM-300x172.png 300w, https:\/\/exceltic.serquo.com\/wp-content\/uploads\/2025\/06\/tabla-comparacion-SLM-y-LLM-18x10.png 18w\" sizes=\"auto, (max-width: 741px) 100vw, 741px\" \/><\/figure>\n\n\n\n<h2 class=\"wp-block-heading has-large-font-size\">G\u00e9n\u00e9ration augment\u00e9e de r\u00e9cup\u00e9ration (RAG)<\/h2>\n\n\n\n<p class=\"has-medium-font-size\">La technique Retrieval Augmented Generation (RAG) est utilis\u00e9e pour am\u00e9liorer la pr\u00e9cision et la contextualisation des mod\u00e8les. Cette m\u00e9thode maximise les r\u00e9ponses en obtenant des informations \u00e0 partir de sources suppl\u00e9mentaires et am\u00e9liore le contexte avant la g\u00e9n\u00e9ration du texte. Sa structure est la suivante<\/p>\n\n\n\n<ol class=\"wp-block-list\">\n<li class=\"has-medium-font-size\"><strong>Chunking<\/strong>fragmentation des donn\u00e9es en parties g\u00e9rables.<\/li>\n\n\n\n<li class=\"has-medium-font-size\"><strong>Encastrements de documents<\/strong>: conversion d'un texte en un tableau num\u00e9rique.<\/li>\n\n\n\n<li class=\"has-medium-font-size\"><strong>Base de donn\u00e9es vectorielles (VectorDB)<\/strong>: <strong>)<\/strong>est une base de donn\u00e9es qui stocke et extrait des donn\u00e9es pertinentes.<\/li>\n\n\n\n<li class=\"has-medium-font-size\"><strong>Recherche d'informations<\/strong>Lorsqu'il est confront\u00e9 \u00e0 une requ\u00eate, il localise les \u00e9l\u00e9ments d'information les plus pertinents.<\/li>\n\n\n\n<li class=\"has-medium-font-size\"><strong>G\u00e9n\u00e9rer des r\u00e9ponses<\/strong>la synth\u00e8se d'informations contextualis\u00e9es afin d'am\u00e9liorer les r\u00e9sultats du mod\u00e8le.<\/li>\n<\/ol>\n\n\n\n<h2 class=\"wp-block-heading has-large-font-size\">Mise en \u0153uvre de SLMs + RAG : un mod\u00e8le efficace et s\u00e9curis\u00e9<\/h2>\n\n\n\n<p class=\"has-medium-font-size\">La combinaison des SLM et de la strat\u00e9gie RAG permet de cr\u00e9er des syst\u00e8mes d'IA g\u00e9n\u00e9rative hautement efficaces et contr\u00f4lables. Gr\u00e2ce \u00e0 cette architecture, les organisations peuvent utiliser des mod\u00e8les optimis\u00e9s qui garantissent une plus grande confidentialit\u00e9 tout en utilisant moins de ressources.<\/p>\n\n\n\n<h3 class=\"wp-block-heading has-medium-large-font-size\">Principaux avantages :<\/h3>\n\n\n\n<ul class=\"wp-block-list\">\n<li class=\"has-medium-font-size\">&nbsp;<strong>Utilisation optimis\u00e9e des donn\u00e9es<\/strong>L'inclusion de la recherche d'informations permet d'obtenir des r\u00e9ponses plus pr\u00e9cises et mieux inform\u00e9es.<\/li>\n\n\n\n<li class=\"has-medium-font-size\"><strong>Contr\u00f4le total du mod\u00e8le :<\/strong> \u00e9vite le recours \u00e0 des services tiers et permet de personnaliser le comportement de l'IA.<\/li>\n\n\n\n<li class=\"has-medium-font-size\"><strong>Ex\u00e9cution dans des environnements restreints :<\/strong> En raison de leur caract\u00e8re quantifiable et de leur taille r\u00e9duite, les SLM peuvent \u00eatre d\u00e9ploy\u00e9s sur des dispositifs p\u00e9riph\u00e9riques ou des serveurs locaux.<\/li>\n<\/ul>\n\n\n\n<h2 class=\"wp-block-heading has-large-font-size\">Architecture de base avec mod\u00e8le quantifi\u00e9, LangChain, RAG et FastAPI<\/h2>\n\n\n\n<p class=\"has-medium-font-size\">L'architecture suivante peut \u00eatre utilis\u00e9e pour cr\u00e9er un environnement SML efficace :<\/p>\n\n\n\n<ol class=\"wp-block-list\">\n<li class=\"has-medium-font-size\"><strong>Chargement du mod\u00e8le quantifi\u00e9 :<\/strong> un mod\u00e8le pr\u00e9alablement quantifi\u00e9 dans INT8 ou INT4 est utilis\u00e9 pour optimiser ses performances.<\/li>\n\n\n\n<li class=\"has-medium-font-size\"><strong>LangChain pour la gestion des invites<\/strong>LangChain : LangChain permet de structurer et d'\u00e9tendre les requ\u00eates du mod\u00e8le.<\/li>\n\n\n\n<li class=\"has-medium-font-size\"><strong>Utilisation de RAG pour une meilleure r\u00e9cup\u00e9ration<\/strong>a : utilise des bases de donn\u00e9es vectorielles pour am\u00e9liorer le contexte des r\u00e9ponses.<\/li>\n\n\n\n<li class=\"has-medium-font-size\"> <strong>API REST avec FastAPI :<\/strong> ce mod\u00e8le explique comment utiliser une API pour faciliter l'int\u00e9gration avec d'autres applications.<\/li>\n<\/ol>\n\n\n\n<h3 class=\"wp-block-heading has-medium-large-font-size\">Exemple de code :<\/h3>\n\n\n\n<p class=\"has-medium-font-size\">from fastapi import FastAPI, HTTPException<\/p>\n\n\n\n<p class=\"has-medium-font-size\">from langchain.chains import RetrievalQA<\/p>\n\n\n\n<p class=\"has-medium-font-size\">from langchain.vectorstores import FAISS<\/p>\n\n\n\n<p class=\"has-medium-font-size\">from langchain.embeddings import HuggingFaceEmbeddings<\/p>\n\n\n\n<p class=\"has-medium-font-size\">from langchain.llms import HuggingFacePipeline<\/p>\n\n\n\n<p class=\"has-medium-font-size\">from transformers import AutoModelForCausalLM, AutoTokenizer, pipeline<\/p>\n\n\n\n<p class=\"has-medium-font-size\">torche d'importation<\/p>\n\n\n\n<p><\/p>\n\n\n\n<p class=\"has-medium-font-size\"># Mod\u00e8le quantifi\u00e9 de la charge<\/p>\n\n\n\n<p class=\"has-medium-font-size\">tokenizer = AutoTokenizer.from_pretrained(\"model-quantified\")<\/p>\n\n\n\n<p class=\"has-medium-font-size\">model = AutoModelForCausalLM.from_pretrained(\"model-quantified\", torch_dtype=torch.int8)<\/p>\n\n\n\n<p class=\"has-medium-font-size\">pipe = pipeline(\"text-generation\", model=model, tokenizer=tokenizer)<\/p>\n\n\n\n<p class=\"has-medium-font-size\">llm = HuggingFacePipeline(pipeline=pipe)<\/p>\n\n\n\n<p><\/p>\n\n\n\n<p class=\"has-medium-font-size\"># Base de donn\u00e9es des vecteurs de charge pour RAG<\/p>\n\n\n\n<p class=\"has-medium-font-size\">embeddings = HuggingFaceEmbeddings(\"sentence-transformers\/all-MiniLM-L6-v2\")<\/p>\n\n\n\n<p class=\"has-medium-font-size\">db = FAISS.load_local(\"ruta_vector_db\", embeddings)<\/p>\n\n\n\n<p class=\"has-medium-font-size\">retriever = db.as_retriever()<\/p>\n\n\n\n<p class=\"has-medium-font-size\">qa_chain = RetrievalQA(llm=llm, retriever=retriever)<\/p>\n\n\n\n<p class=\"has-medium-font-size\">app = FastAPI()<\/p>\n\n\n\n<p class=\"has-medium-font-size\">@app.post(\"\/generate\")<\/p>\n\n\n\n<p class=\"has-medium-font-size\">def generate_response(prompt : str, max_length : int = 100) :<\/p>\n\n\n\n<p class=\"has-medium-font-size\">&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp; essayer :<\/p>\n\n\n\n<p class=\"has-medium-font-size\">&nbsp;&nbsp;&nbsp; &nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp; response = qa_chain.run(prompt)<\/p>\n\n\n\n<p class=\"has-medium-font-size\">&nbsp;&nbsp;&nbsp; &nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp; return {\"answer\" : answer}<\/p>\n\n\n\n<p class=\"has-medium-font-size\">&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp; sauf Exception comme e :<\/p>\n\n\n\n<p class=\"has-medium-font-size\">&nbsp;&nbsp;&nbsp; &nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp; raise HTTPException(status_code=500, detail=str(e))<\/p>\n\n\n\n<p class=\"has-medium-font-size\">si __name__ == \"__main__\" :<\/p>\n\n\n\n<p class=\"has-medium-font-size\">&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp; import uvicorn<\/p>\n\n\n\n<p class=\"has-medium-font-size\">&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp; uvicorn.run(app, host=\"0.0.0.0.0\u2033, port=8000)<\/p>\n\n\n\n<h2 class=\"wp-block-heading has-large-font-size\">Explication<\/h2>\n\n\n\n<h3 class=\"wp-block-heading has-medium-large-font-size\">Chargement du mod\u00e8le quantifi\u00e9<\/h3>\n\n\n\n<p class=\"has-medium-font-size\"><em>tokenizer = AutoTokenizer.from_pretrained(\"model-quantified\")<\/em><\/p>\n\n\n\n<p class=\"has-medium-font-size\"><em>model = AutoModelForCausalLM.from_pretrained(\"model-quantified\", torch_dtype=torch.int8)<\/em><\/p>\n\n\n\n<p class=\"has-medium-font-size\"><em>pipe = pipeline(\"text-generation\", model=model, tokenizer=tokenizer)<\/em><\/p>\n\n\n\n<p class=\"has-medium-font-size\"><em>llm = HuggingFacePipeline(pipeline=pipe)<\/em><\/p>\n\n\n\n<ul class=\"wp-block-list\">\n<li class=\"has-medium-font-size\"><strong>AutoTokenizer.from_pretrained(\"model-quantified\") : <\/strong>Charge le tokeniser du mod\u00e8le quantifi\u00e9.<\/li>\n\n\n\n<li class=\"has-medium-font-size\"><strong> AutoModelForCausalLM.from_pretrained(\"model-quantified\", torch_dtype=torch.int8) : <\/strong>Charge le mod\u00e8le quantifi\u00e9 en pr\u00e9cision int8, ce qui r\u00e9duit l'utilisation de la m\u00e9moire et acc\u00e9l\u00e8re l'inf\u00e9rence.<\/li>\n\n\n\n<li class=\"has-medium-font-size\"><strong>pipeline(\"text-generation\", model=model, tokenizer=tokenizer) :<\/strong> Cr\u00e9e un pipeline de g\u00e9n\u00e9ration de texte bas\u00e9 sur le mod\u00e8le quantifi\u00e9.<\/li>\n\n\n\n<li class=\"has-medium-font-size\"><strong>HuggingFacePipeline(pipeline=pipe) :<\/strong> Int\u00e9grer le pipeline dans LangChain pour une utilisation ult\u00e9rieure dans l'architecture RAG.<\/li>\n<\/ul>\n\n\n\n<h3 class=\"wp-block-heading has-medium-large-font-size\">2. Configuration RAG (Retrieval-Augmented Generation) :<\/h3>\n\n\n\n<p class=\"has-medium-font-size\"><em>embeddings = HuggingFaceEmbeddings(\"sentence-transformers\/all-MiniLM-L6-v2\")<\/em><\/p>\n\n\n\n<p class=\"has-medium-font-size\"><em>db = FAISS.load_local(\"ruta_vector_db\", embeddings)<\/em><\/p>\n\n\n\n<p class=\"has-medium-font-size\"><em>retriever = db.as_retriever()<\/em><\/p>\n\n\n\n<p class=\"has-medium-font-size\"><em>qa_chain = RetrievalQA(llm=llm, retriever=retriever<strong>)<\/strong><\/em><\/p>\n\n\n\n<ul class=\"wp-block-list\">\n<li class=\"has-medium-font-size\"><strong>HuggingFaceEmbeddings(\"sentence-transformers\/all-MiniLM-L6-v2\") : <\/strong>Utilise un mod\u00e8le d'int\u00e9gration (MiniLM-L6-v2) pour convertir le texte en repr\u00e9sentations vectorielles.<\/li>\n\n\n\n<li class=\"has-medium-font-size\"><strong>FAISS.load_local(\"path_vector_db\", embeddings) : <\/strong>Charge une base de donn\u00e9es vectorielles FAISS avec les encastrements pr\u00e9c\u00e9demment g\u00e9n\u00e9r\u00e9s.<\/li>\n\n\n\n<li class=\"has-medium-font-size\"><strong>db.as_retriever() :<\/strong> Il transforme la base de donn\u00e9es en un moteur de recherche permettant d'extraire des informations pertinentes.<\/li>\n\n\n\n<li class=\"has-medium-font-size\"><strong>RetrievalQA(llm=llm, retriever=retriever)<\/strong>Combine la mod\u00e9lisation quantifi\u00e9e du langage avec la recherche d'informations pour am\u00e9liorer la g\u00e9n\u00e9ration de r\u00e9ponses.<\/li>\n<\/ul>\n\n\n\n<h3 class=\"wp-block-heading has-medium-large-font-size\">3. Cr\u00e9ation de l'API avec FastAPI :<\/h3>\n\n\n\n<p class=\"has-medium-font-size\"><em>app = FastAPI()<\/em><\/p>\n\n\n\n<p class=\"has-medium-font-size\"><em>@app.post(\"\/generate\")<\/em><\/p>\n\n\n\n<p class=\"has-medium-font-size\"><em>def generate_response(prompt : str, max_length : int = 100) :<\/em><\/p>\n\n\n\n<p class=\"has-medium-font-size\"><em>&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp; essayer :<\/em><\/p>\n\n\n\n<p class=\"has-medium-font-size\"><em>&nbsp;&nbsp;&nbsp; &nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp; response = qa_chain.run(prompt)<\/em><\/p>\n\n\n\n<p class=\"has-medium-font-size\"><em>&nbsp;&nbsp;&nbsp; &nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp; return {\"answer\" : answer}<\/em><\/p>\n\n\n\n<p class=\"has-medium-font-size\"><em>&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp; sauf Exception comme e :<\/em><\/p>\n\n\n\n<p class=\"has-medium-font-size\"><em>&nbsp;&nbsp;&nbsp; &nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp; raise HTTPException(status_code=500, detail=str(e))<\/em><\/p>\n\n\n\n<ul class=\"wp-block-list\">\n<li class=\"has-medium-font-size\"><strong>FastAPI() : <\/strong>Cr\u00e9er une API REST pour exposer le mod\u00e8le et la fonctionnalit\u00e9 RAG.<strong>.<\/strong><\/li>\n\n\n\n<li class=\"has-medium-font-size\"><strong>@app.post(\"\/generate\") :<\/strong> D\u00e9finit un point de terminaison \/generate qui accepte les requ\u00eates POST avec une invite de saisie.<\/li>\n\n\n\n<li class=\"has-medium-font-size\"><strong>qa_chain.run(prompt) : <\/strong>Il utilise une combinaison de recherche d'informations (RAG) et de g\u00e9n\u00e9ration de texte pour r\u00e9pondre.<\/li>\n\n\n\n<li class=\"has-medium-font-size\"><strong>Gestion des exceptions :<\/strong> En cas d'erreur, un code HTTP 500 est renvoy\u00e9 avec le message d'erreur.<\/li>\n<\/ul>\n\n\n\n<h3 class=\"wp-block-heading has-medium-large-font-size\">4. Ex\u00e9cution du serveur :<\/h3>\n\n\n\n<p class=\"has-medium-font-size\"><em>si __name__ == \"__main__\" :<\/em><\/p>\n\n\n\n<p class=\"has-medium-font-size\"><em>&nbsp;&nbsp;&nbsp; import uvicorn<\/em><\/p>\n\n\n\n<p class=\"has-medium-font-size\"><em>&nbsp;&nbsp;&nbsp; uvicorn.run(app, host=\"0.0.0.0.0\u2033, port=8000)<\/em><\/p>\n\n\n\n<ul class=\"wp-block-list\">\n<li class=\"has-medium-font-size\"><code><strong>uvicorn.run(app, host=\"0.0.0.0\", port=8000)<\/strong><\/code>: d\u00e9marre le serveur sur le port 8000, ce qui permet d'acc\u00e9der \u00e0 l'API.<\/li>\n<\/ul>\n\n\n\n<h3 class=\"wp-block-heading has-medium-large-font-size\">Conclusion<\/h3>\n\n\n\n<p class=\"has-medium-font-size\">Les entreprises et les d\u00e9veloppeurs peuvent d\u00e9sormais adopter <strong>des mod\u00e8les plus l\u00e9gers, plus rapides et plus priv\u00e9s gr\u00e2ce aux SLM (Small Language Models)<\/strong>une avanc\u00e9e majeure dans l'\u00e9volution de l'IA g\u00e9n\u00e9rative. Les <strong>quantification des mod\u00e8les<\/strong> -qui r\u00e9duit consid\u00e9rablement la taille et les besoins de calcul sans compromettre les performances de base - permet \u00e0 ces mod\u00e8les d'\u00eatre ex\u00e9cut\u00e9s dans des environnements <strong>sur place<\/strong> ou sur des appareils aux ressources limit\u00e9es, tout en conservant un contr\u00f4le total sur les donn\u00e9es et les processus.<\/p>\n\n\n\n<p class=\"has-medium-font-size\">Cette approche est compl\u00e9t\u00e9e par l'architecture bas\u00e9e sur <strong>RAG (Retrieval-Augmented Generation)<\/strong>avec des outils tels que <strong>FastAPI et LangChain<\/strong>Ces strat\u00e9gies permettent de d\u00e9ployer des solutions d'IA qui sont gouvernables, contr\u00f4lables et adapt\u00e9es \u00e0 des besoins sp\u00e9cifiques. Ces strat\u00e9gies permettent de <strong>g\u00e9n\u00e9ration d'IA enti\u00e8rement contr\u00f4l\u00e9e<\/strong>ce qui en fait un choix r\u00e9aliste et efficace pour les secteurs exigeants tels que l'analyse de donn\u00e9es, la recherche scientifique ou le service \u00e0 la client\u00e8le.<\/p>\n\n\n\n<p class=\"has-medium-font-size\">La combinaison de SLM quantifi\u00e9s, d'une architecture modulaire et d'un d\u00e9ploiement autonome repr\u00e9sente l'un des moyens les plus s\u00fbrs et les plus efficaces d'int\u00e9grer l'IA g\u00e9n\u00e9rative dans votre organisation.<\/p>\n\n\n\n<h3 class=\"wp-block-heading has-medium-large-font-size\">Vous voulez voir comment cela se traduit dans un cas concret ?<\/h3>\n\n\n\n<p class=\"has-medium-font-size\">Acc\u00e9der \u00e0 l'article complet et en savoir plus.<\/p>\n\n\n<div class=\"wpforms-container wpforms-container-full wpforms-block wpforms-block-0479e51e-0ca1-4f8c-8b97-091019a9064a\" id=\"wpforms-3927\"><form id=\"wpforms-form-3927\" class=\"wpforms-validate wpforms-form wpforms-ajax-form\" data-formid=\"3927\" method=\"post\" enctype=\"multipart\/form-data\" action=\"\/fr\/wp-json\/wp\/v2\/posts\/3910\" data-token=\"21e0f5bc03a6a79512f036010d75f312\" data-token-time=\"1776231466\" data-trp-original-action=\"\/fr\/wp-json\/wp\/v2\/posts\/3910\"><noscript class=\"wpforms-error-noscript\">Veuillez activer JavaScript dans votre navigateur pour remplir ce formulaire.<\/noscript><div class=\"wpforms-field-container\"><div id=\"wpforms-3927-field_1-container\" class=\"wpforms-field wpforms-field-text ocultar-urlASUNTO\" data-field-id=\"1\"><label class=\"wpforms-field-label\" for=\"wpforms-3927-field_1\">Sujet <span class=\"wpforms-required-label\">*<\/span><\/label><input type=\"text\" id=\"wpforms-3927-field_1\" class=\"wpforms-field-large wpforms-field-required\" name=\"wpforms[fields][1]\" value=\"IA Generativa con Control Total: Peque\u00f1os Modelos de Lenguaje (SLMs) y Cuantizaci\u00f3n\" required><\/div><div id=\"wpforms-3927-field_2-container\" class=\"wpforms-field wpforms-field-text\" data-field-id=\"2\"><label class=\"wpforms-field-label wpforms-label-hide\" for=\"wpforms-3927-field_2\">Nom <span class=\"wpforms-required-label\">*<\/span><\/label><input type=\"text\" id=\"wpforms-3927-field_2\" class=\"wpforms-field-large wpforms-field-required\" name=\"wpforms[fields][2]\" placeholder=\"Nom\" required><\/div><div id=\"wpforms-3927-field_3-container\" class=\"wpforms-field wpforms-field-text\" data-field-id=\"3\"><label class=\"wpforms-field-label wpforms-label-hide\" for=\"wpforms-3927-field_3\">Nom de famille <span class=\"wpforms-required-label\">*<\/span><\/label><input type=\"text\" id=\"wpforms-3927-field_3\" class=\"wpforms-field-large wpforms-field-required\" name=\"wpforms[fields][3]\" placeholder=\"Nom de famille\" required><\/div><div id=\"wpforms-3927-field_4-container\" class=\"wpforms-field wpforms-field-text\" data-field-id=\"4\"><label class=\"wpforms-field-label wpforms-label-hide\" for=\"wpforms-3927-field_4\"> Adresse \u00e9lectronique <span class=\"wpforms-required-label\">*<\/span><\/label><input type=\"text\" id=\"wpforms-3927-field_4\" class=\"wpforms-field-large wpforms-field-required\" name=\"wpforms[fields][4]\" placeholder=\"Adresse \u00e9lectronique\" required><\/div><div id=\"wpforms-3927-field_5-container\" class=\"wpforms-field wpforms-field-text\" data-field-id=\"5\"><label class=\"wpforms-field-label wpforms-label-hide\" for=\"wpforms-3927-field_5\">Num\u00e9ro de t\u00e9l\u00e9phone du contact <\/label><input type=\"text\" id=\"wpforms-3927-field_5\" class=\"wpforms-field-large\" name=\"wpforms[fields][5]\" placeholder=\"Num\u00e9ro de t\u00e9l\u00e9phone du contact\" ><\/div><div id=\"wpforms-3927-field_6-container\" class=\"wpforms-field wpforms-field-checkbox boxtransparent\" data-field-id=\"6\"><label class=\"wpforms-field-label wpforms-label-hide\">Acceptez-vous la politique de confidentialit\u00e9 ? <span class=\"wpforms-required-label\">*<\/span><\/label><ul id=\"wpforms-3927-field_6\" class=\"wpforms-field-required\"><li class=\"choice-1 depth-1\"><input type=\"checkbox\" id=\"wpforms-3927-field_6_1\" name=\"wpforms[fields][6][]\" value=\"He le\u00eddo y acepto la pol\u00edtica de privacidad\" required><label class=\"wpforms-field-label-inline\" for=\"wpforms-3927-field_6_1\">J'ai lu et j'accepte la politique de confidentialit\u00e9 <span class=\"wpforms-required-label\">*<\/span><\/label><\/li><\/ul><div class=\"wpforms-field-description wpforms-disclaimer-description\"><b>IMPORTANT :<\/b> Lire notre <a href='https:\/\/exceltic.com\/politica-de-privacidad\/' target='_blank' style=\"color: #e74b10;\" >Politique de confidentialit\u00e9<\/a> avant de poursuivre. Les informations que vous fournissez peuvent contenir des informations personnelles.<\/div><\/div><\/div><!-- .wpforms-field-container --><div class=\"wpforms-recaptcha-container wpforms-is-recaptcha wpforms-is-recaptcha-type-v2\" ><div class=\"g-recaptcha\" data-sitekey=\"6Lf9t3srAAAAAEstc-76vP4_LAvjdEbLQwgAri5D\"><\/div><input type=\"text\" name=\"g-recaptcha-hidden\" class=\"wpforms-recaptcha-hidden\" style=\"position:absolute!important;clip:rect(0,0,0,0)!important;height:1px!important;width:1px!important;border:0!important;overflow:hidden!important;padding:0!important;margin:0!important;\" data-rule-recaptcha=\"1\"><\/div><div class=\"wpforms-submit-container\" ><input type=\"hidden\" name=\"wpforms[id]\" value=\"3927\"><input type=\"hidden\" name=\"page_title\" value=\"\"><input type=\"hidden\" name=\"page_url\" value=\"https:\/\/exceltic.serquo.com\/fr\/wp-json\/wp\/v2\/posts\/3910\"><input type=\"hidden\" name=\"url_referer\" value=\"\"><button type=\"submit\" name=\"wpforms[submit]\" id=\"wpforms-submit-3927\" class=\"wpforms-submit\" data-alt-text=\"Enviando...\" data-submit-text=\"Enviar\" aria-live=\"assertive\" value=\"wpforms-submit\">Envoyer<\/button><img loading=\"lazy\" decoding=\"async\" src=\"https:\/\/exceltic.serquo.com\/wp-content\/plugins\/wpforms-lite\/assets\/images\/submit-spin.svg\" class=\"wpforms-submit-spinner\" style=\"display: none;\" width=\"26\" height=\"26\" alt=\"Chargement en cours\"><\/div><input type=\"hidden\" name=\"trp-form-language\" value=\"fr\"\/><\/form><\/div>  <!-- .wpforms-container -->","protected":false},"excerpt":{"rendered":"<p>Mettre en \u0153uvre l'IA g\u00e9n\u00e9rative avec des mod\u00e8les quantifi\u00e9s de petite taille. Gagner en efficacit\u00e9, en confidentialit\u00e9 et en contr\u00f4le total avec SLM et RAG dans des environnements s\u00e9curis\u00e9s.<\/p>","protected":false},"author":1,"featured_media":3916,"comment_status":"closed","ping_status":"closed","sticky":false,"template":"","format":"standard","meta":{"footnotes":""},"categories":[3],"tags":[],"class_list":["post-3910","post","type-post","status-publish","format-standard","has-post-thumbnail","hentry","category-eventos"],"yoast_head":"<!-- This site is optimized with the Yoast SEO plugin v26.9 - https:\/\/yoast.com\/product\/yoast-seo-wordpress\/ -->\n<title>IA Generativa con Control Total: Peque\u00f1os Modelos de Lenguaje (SLMs) y Cuantizaci\u00f3n -<\/title>\n<meta name=\"description\" content=\"Implementa IA generativa con modelos peque\u00f1os y cuantizados. Gana eficiencia, privacidad y control total con SLM y RAG en entornos seguros.\" \/>\n<meta name=\"robots\" content=\"index, follow, max-snippet:-1, max-image-preview:large, max-video-preview:-1\" \/>\n<link rel=\"canonical\" href=\"https:\/\/exceltic.serquo.com\/fr\/ia-generative-avec-controle-total-petits-modeles-linguistiques-slms-et-quantification-2\/\" \/>\n<meta property=\"og:locale\" content=\"fr_FR\" \/>\n<meta property=\"og:type\" content=\"article\" \/>\n<meta property=\"og:title\" content=\"IA Generativa con Control Total: Peque\u00f1os Modelos de Lenguaje (SLMs) y Cuantizaci\u00f3n -\" \/>\n<meta property=\"og:description\" content=\"Implementa IA generativa con modelos peque\u00f1os y cuantizados. Gana eficiencia, privacidad y control total con SLM y RAG en entornos seguros.\" \/>\n<meta property=\"og:url\" content=\"https:\/\/exceltic.serquo.com\/fr\/ia-generative-avec-controle-total-petits-modeles-linguistiques-slms-et-quantification-2\/\" \/>\n<meta property=\"article:publisher\" content=\"https:\/\/www.facebook.com\/exceltic\/\" \/>\n<meta property=\"article:published_time\" content=\"2025-06-23T14:25:41+00:00\" \/>\n<meta property=\"article:modified_time\" content=\"2026-03-19T12:28:24+00:00\" \/>\n<meta property=\"og:image\" content=\"https:\/\/exceltic.serquo.com\/wp-content\/uploads\/2025\/06\/portada-blog-ia-generativa.png\" \/>\n\t<meta property=\"og:image:width\" content=\"1920\" \/>\n\t<meta property=\"og:image:height\" content=\"1080\" \/>\n\t<meta property=\"og:image:type\" content=\"image\/png\" \/>\n<meta name=\"author\" content=\"Serquo Admin\" \/>\n<meta name=\"twitter:card\" content=\"summary_large_image\" \/>\n<meta name=\"twitter:creator\" content=\"@exceltic\" \/>\n<meta name=\"twitter:site\" content=\"@exceltic\" \/>\n<meta name=\"twitter:label1\" content=\"\u00c9crit par\" \/>\n\t<meta name=\"twitter:data1\" content=\"Serquo Admin\" \/>\n\t<meta name=\"twitter:label2\" content=\"Dur\u00e9e de lecture estim\u00e9e\" \/>\n\t<meta name=\"twitter:data2\" content=\"8 minutes\" \/>\n<script type=\"application\/ld+json\" class=\"yoast-schema-graph\">{\"@context\":\"https:\/\/schema.org\",\"@graph\":[{\"@type\":\"Article\",\"@id\":\"https:\/\/exceltic.serquo.com\/ia-generativa-con-control-total-pequenos-modelos-de-lenguaje-slms-y-cuantizacion-2\/#article\",\"isPartOf\":{\"@id\":\"https:\/\/exceltic.serquo.com\/ia-generativa-con-control-total-pequenos-modelos-de-lenguaje-slms-y-cuantizacion-2\/\"},\"author\":{\"name\":\"Serquo Admin\",\"@id\":\"https:\/\/exceltic.serquo.com\/#\/schema\/person\/aa4f97f58379cc64179590f276472ad5\"},\"headline\":\"IA Generativa con Control Total: Peque\u00f1os Modelos de Lenguaje (SLMs) y Cuantizaci\u00f3n\",\"datePublished\":\"2025-06-23T14:25:41+00:00\",\"dateModified\":\"2026-03-19T12:28:24+00:00\",\"mainEntityOfPage\":{\"@id\":\"https:\/\/exceltic.serquo.com\/ia-generativa-con-control-total-pequenos-modelos-de-lenguaje-slms-y-cuantizacion-2\/\"},\"wordCount\":1557,\"image\":{\"@id\":\"https:\/\/exceltic.serquo.com\/ia-generativa-con-control-total-pequenos-modelos-de-lenguaje-slms-y-cuantizacion-2\/#primaryimage\"},\"thumbnailUrl\":\"https:\/\/exceltic.serquo.com\/wp-content\/uploads\/2025\/06\/portada-blog-ia-generativa.png\",\"articleSection\":[\"Eventos\"],\"inLanguage\":\"fr-FR\"},{\"@type\":\"WebPage\",\"@id\":\"https:\/\/exceltic.serquo.com\/ia-generativa-con-control-total-pequenos-modelos-de-lenguaje-slms-y-cuantizacion-2\/\",\"url\":\"https:\/\/exceltic.serquo.com\/ia-generativa-con-control-total-pequenos-modelos-de-lenguaje-slms-y-cuantizacion-2\/\",\"name\":\"IA Generativa con Control Total: Peque\u00f1os Modelos de Lenguaje (SLMs) y Cuantizaci\u00f3n -\",\"isPartOf\":{\"@id\":\"https:\/\/exceltic.serquo.com\/#website\"},\"primaryImageOfPage\":{\"@id\":\"https:\/\/exceltic.serquo.com\/ia-generativa-con-control-total-pequenos-modelos-de-lenguaje-slms-y-cuantizacion-2\/#primaryimage\"},\"image\":{\"@id\":\"https:\/\/exceltic.serquo.com\/ia-generativa-con-control-total-pequenos-modelos-de-lenguaje-slms-y-cuantizacion-2\/#primaryimage\"},\"thumbnailUrl\":\"https:\/\/exceltic.serquo.com\/wp-content\/uploads\/2025\/06\/portada-blog-ia-generativa.png\",\"datePublished\":\"2025-06-23T14:25:41+00:00\",\"dateModified\":\"2026-03-19T12:28:24+00:00\",\"author\":{\"@id\":\"https:\/\/exceltic.serquo.com\/#\/schema\/person\/aa4f97f58379cc64179590f276472ad5\"},\"description\":\"Implementa IA generativa con modelos peque\u00f1os y cuantizados. Gana eficiencia, privacidad y control total con SLM y RAG en entornos seguros.\",\"breadcrumb\":{\"@id\":\"https:\/\/exceltic.serquo.com\/ia-generativa-con-control-total-pequenos-modelos-de-lenguaje-slms-y-cuantizacion-2\/#breadcrumb\"},\"inLanguage\":\"fr-FR\",\"potentialAction\":[{\"@type\":\"ReadAction\",\"target\":[\"https:\/\/exceltic.serquo.com\/ia-generativa-con-control-total-pequenos-modelos-de-lenguaje-slms-y-cuantizacion-2\/\"]}]},{\"@type\":\"ImageObject\",\"inLanguage\":\"fr-FR\",\"@id\":\"https:\/\/exceltic.serquo.com\/ia-generativa-con-control-total-pequenos-modelos-de-lenguaje-slms-y-cuantizacion-2\/#primaryimage\",\"url\":\"https:\/\/exceltic.serquo.com\/wp-content\/uploads\/2025\/06\/portada-blog-ia-generativa.png\",\"contentUrl\":\"https:\/\/exceltic.serquo.com\/wp-content\/uploads\/2025\/06\/portada-blog-ia-generativa.png\",\"width\":1920,\"height\":1080},{\"@type\":\"BreadcrumbList\",\"@id\":\"https:\/\/exceltic.serquo.com\/ia-generativa-con-control-total-pequenos-modelos-de-lenguaje-slms-y-cuantizacion-2\/#breadcrumb\",\"itemListElement\":[{\"@type\":\"ListItem\",\"position\":1,\"name\":\"Portada\",\"item\":\"https:\/\/exceltic.serquo.com\/\"},{\"@type\":\"ListItem\",\"position\":2,\"name\":\"IA Generativa con Control Total: Peque\u00f1os Modelos de Lenguaje (SLMs) y Cuantizaci\u00f3n\"}]},{\"@type\":\"WebSite\",\"@id\":\"https:\/\/exceltic.serquo.com\/#website\",\"url\":\"https:\/\/exceltic.serquo.com\/\",\"name\":\"\",\"description\":\"Ingenier\u00eda y Consultor\u00eda\",\"potentialAction\":[{\"@type\":\"SearchAction\",\"target\":{\"@type\":\"EntryPoint\",\"urlTemplate\":\"https:\/\/exceltic.serquo.com\/?s={search_term_string}\"},\"query-input\":{\"@type\":\"PropertyValueSpecification\",\"valueRequired\":true,\"valueName\":\"search_term_string\"}}],\"inLanguage\":\"fr-FR\"},{\"@type\":\"Person\",\"@id\":\"https:\/\/exceltic.serquo.com\/#\/schema\/person\/aa4f97f58379cc64179590f276472ad5\",\"name\":\"Serquo Admin\",\"image\":{\"@type\":\"ImageObject\",\"inLanguage\":\"fr-FR\",\"@id\":\"https:\/\/exceltic.serquo.com\/#\/schema\/person\/image\/\",\"url\":\"https:\/\/secure.gravatar.com\/avatar\/c335bd9da7b1e9a671364c54e17ba39cb4a646b8eb953383973649c257b1fe49?s=96&d=mm&r=g\",\"contentUrl\":\"https:\/\/secure.gravatar.com\/avatar\/c335bd9da7b1e9a671364c54e17ba39cb4a646b8eb953383973649c257b1fe49?s=96&d=mm&r=g\",\"caption\":\"Serquo Admin\"},\"url\":\"https:\/\/exceltic.serquo.com\/fr\/author\/serquo\/\"}]}<\/script>\n<!-- \/ Yoast SEO plugin. -->","yoast_head_json":{"title":"IA g\u00e9n\u00e9rative avec contr\u00f4le total : petits mod\u00e8les de langage (SLM) et quantification","description":"Mettre en \u0153uvre l'IA g\u00e9n\u00e9rative avec des mod\u00e8les quantifi\u00e9s de petite taille. Gagner en efficacit\u00e9, en confidentialit\u00e9 et en contr\u00f4le total avec SLM et RAG dans des environnements s\u00e9curis\u00e9s.","robots":{"index":"index","follow":"follow","max-snippet":"max-snippet:-1","max-image-preview":"max-image-preview:large","max-video-preview":"max-video-preview:-1"},"canonical":"https:\/\/exceltic.serquo.com\/fr\/ia-generative-avec-controle-total-petits-modeles-linguistiques-slms-et-quantification-2\/","og_locale":"fr_FR","og_type":"article","og_title":"IA Generativa con Control Total: Peque\u00f1os Modelos de Lenguaje (SLMs) y Cuantizaci\u00f3n -","og_description":"Implementa IA generativa con modelos peque\u00f1os y cuantizados. Gana eficiencia, privacidad y control total con SLM y RAG en entornos seguros.","og_url":"https:\/\/exceltic.serquo.com\/fr\/ia-generative-avec-controle-total-petits-modeles-linguistiques-slms-et-quantification-2\/","article_publisher":"https:\/\/www.facebook.com\/exceltic\/","article_published_time":"2025-06-23T14:25:41+00:00","article_modified_time":"2026-03-19T12:28:24+00:00","og_image":[{"width":1920,"height":1080,"url":"https:\/\/exceltic.serquo.com\/wp-content\/uploads\/2025\/06\/portada-blog-ia-generativa.png","type":"image\/png"}],"author":"Serquo Admin","twitter_card":"summary_large_image","twitter_creator":"@exceltic","twitter_site":"@exceltic","twitter_misc":{"\u00c9crit par":"Serquo Admin","Dur\u00e9e de lecture estim\u00e9e":"8 minutes"},"schema":{"@context":"https:\/\/schema.org","@graph":[{"@type":"Article","@id":"https:\/\/exceltic.serquo.com\/ia-generativa-con-control-total-pequenos-modelos-de-lenguaje-slms-y-cuantizacion-2\/#article","isPartOf":{"@id":"https:\/\/exceltic.serquo.com\/ia-generativa-con-control-total-pequenos-modelos-de-lenguaje-slms-y-cuantizacion-2\/"},"author":{"name":"Serquo Admin","@id":"https:\/\/exceltic.serquo.com\/#\/schema\/person\/aa4f97f58379cc64179590f276472ad5"},"headline":"IA Generativa con Control Total: Peque\u00f1os Modelos de Lenguaje (SLMs) y Cuantizaci\u00f3n","datePublished":"2025-06-23T14:25:41+00:00","dateModified":"2026-03-19T12:28:24+00:00","mainEntityOfPage":{"@id":"https:\/\/exceltic.serquo.com\/ia-generativa-con-control-total-pequenos-modelos-de-lenguaje-slms-y-cuantizacion-2\/"},"wordCount":1557,"image":{"@id":"https:\/\/exceltic.serquo.com\/ia-generativa-con-control-total-pequenos-modelos-de-lenguaje-slms-y-cuantizacion-2\/#primaryimage"},"thumbnailUrl":"https:\/\/exceltic.serquo.com\/wp-content\/uploads\/2025\/06\/portada-blog-ia-generativa.png","articleSection":["Eventos"],"inLanguage":"fr-FR"},{"@type":"WebPage","@id":"https:\/\/exceltic.serquo.com\/ia-generativa-con-control-total-pequenos-modelos-de-lenguaje-slms-y-cuantizacion-2\/","url":"https:\/\/exceltic.serquo.com\/ia-generativa-con-control-total-pequenos-modelos-de-lenguaje-slms-y-cuantizacion-2\/","name":"IA g\u00e9n\u00e9rative avec contr\u00f4le total : petits mod\u00e8les de langage (SLM) et quantification","isPartOf":{"@id":"https:\/\/exceltic.serquo.com\/#website"},"primaryImageOfPage":{"@id":"https:\/\/exceltic.serquo.com\/ia-generativa-con-control-total-pequenos-modelos-de-lenguaje-slms-y-cuantizacion-2\/#primaryimage"},"image":{"@id":"https:\/\/exceltic.serquo.com\/ia-generativa-con-control-total-pequenos-modelos-de-lenguaje-slms-y-cuantizacion-2\/#primaryimage"},"thumbnailUrl":"https:\/\/exceltic.serquo.com\/wp-content\/uploads\/2025\/06\/portada-blog-ia-generativa.png","datePublished":"2025-06-23T14:25:41+00:00","dateModified":"2026-03-19T12:28:24+00:00","author":{"@id":"https:\/\/exceltic.serquo.com\/#\/schema\/person\/aa4f97f58379cc64179590f276472ad5"},"description":"Mettre en \u0153uvre l'IA g\u00e9n\u00e9rative avec des mod\u00e8les quantifi\u00e9s de petite taille. Gagner en efficacit\u00e9, en confidentialit\u00e9 et en contr\u00f4le total avec SLM et RAG dans des environnements s\u00e9curis\u00e9s.","breadcrumb":{"@id":"https:\/\/exceltic.serquo.com\/ia-generativa-con-control-total-pequenos-modelos-de-lenguaje-slms-y-cuantizacion-2\/#breadcrumb"},"inLanguage":"fr-FR","potentialAction":[{"@type":"ReadAction","target":["https:\/\/exceltic.serquo.com\/ia-generativa-con-control-total-pequenos-modelos-de-lenguaje-slms-y-cuantizacion-2\/"]}]},{"@type":"ImageObject","inLanguage":"fr-FR","@id":"https:\/\/exceltic.serquo.com\/ia-generativa-con-control-total-pequenos-modelos-de-lenguaje-slms-y-cuantizacion-2\/#primaryimage","url":"https:\/\/exceltic.serquo.com\/wp-content\/uploads\/2025\/06\/portada-blog-ia-generativa.png","contentUrl":"https:\/\/exceltic.serquo.com\/wp-content\/uploads\/2025\/06\/portada-blog-ia-generativa.png","width":1920,"height":1080},{"@type":"BreadcrumbList","@id":"https:\/\/exceltic.serquo.com\/ia-generativa-con-control-total-pequenos-modelos-de-lenguaje-slms-y-cuantizacion-2\/#breadcrumb","itemListElement":[{"@type":"ListItem","position":1,"name":"Portada","item":"https:\/\/exceltic.serquo.com\/"},{"@type":"ListItem","position":2,"name":"IA Generativa con Control Total: Peque\u00f1os Modelos de Lenguaje (SLMs) y Cuantizaci\u00f3n"}]},{"@type":"WebSite","@id":"https:\/\/exceltic.serquo.com\/#website","url":"https:\/\/exceltic.serquo.com\/","name":"","description":"Ing\u00e9nierie et conseil","potentialAction":[{"@type":"SearchAction","target":{"@type":"EntryPoint","urlTemplate":"https:\/\/exceltic.serquo.com\/?s={search_term_string}"},"query-input":{"@type":"PropertyValueSpecification","valueRequired":true,"valueName":"search_term_string"}}],"inLanguage":"fr-FR"},{"@type":"Person","@id":"https:\/\/exceltic.serquo.com\/#\/schema\/person\/aa4f97f58379cc64179590f276472ad5","name":"Serquo Admin","image":{"@type":"ImageObject","inLanguage":"fr-FR","@id":"https:\/\/exceltic.serquo.com\/#\/schema\/person\/image\/","url":"https:\/\/secure.gravatar.com\/avatar\/c335bd9da7b1e9a671364c54e17ba39cb4a646b8eb953383973649c257b1fe49?s=96&d=mm&r=g","contentUrl":"https:\/\/secure.gravatar.com\/avatar\/c335bd9da7b1e9a671364c54e17ba39cb4a646b8eb953383973649c257b1fe49?s=96&d=mm&r=g","caption":"Serquo Admin"},"url":"https:\/\/exceltic.serquo.com\/fr\/author\/serquo\/"}]}},"_links":{"self":[{"href":"https:\/\/exceltic.serquo.com\/fr\/wp-json\/wp\/v2\/posts\/3910","targetHints":{"allow":["GET"]}}],"collection":[{"href":"https:\/\/exceltic.serquo.com\/fr\/wp-json\/wp\/v2\/posts"}],"about":[{"href":"https:\/\/exceltic.serquo.com\/fr\/wp-json\/wp\/v2\/types\/post"}],"author":[{"embeddable":true,"href":"https:\/\/exceltic.serquo.com\/fr\/wp-json\/wp\/v2\/users\/1"}],"replies":[{"embeddable":true,"href":"https:\/\/exceltic.serquo.com\/fr\/wp-json\/wp\/v2\/comments?post=3910"}],"version-history":[{"count":10,"href":"https:\/\/exceltic.serquo.com\/fr\/wp-json\/wp\/v2\/posts\/3910\/revisions"}],"predecessor-version":[{"id":3986,"href":"https:\/\/exceltic.serquo.com\/fr\/wp-json\/wp\/v2\/posts\/3910\/revisions\/3986"}],"wp:featuredmedia":[{"embeddable":true,"href":"https:\/\/exceltic.serquo.com\/fr\/wp-json\/wp\/v2\/media\/3916"}],"wp:attachment":[{"href":"https:\/\/exceltic.serquo.com\/fr\/wp-json\/wp\/v2\/media?parent=3910"}],"wp:term":[{"taxonomy":"category","embeddable":true,"href":"https:\/\/exceltic.serquo.com\/fr\/wp-json\/wp\/v2\/categories?post=3910"},{"taxonomy":"post_tag","embeddable":true,"href":"https:\/\/exceltic.serquo.com\/fr\/wp-json\/wp\/v2\/tags?post=3910"}],"curies":[{"name":"wp","href":"https:\/\/api.w.org\/{rel}","templated":true}]}}