{"id":95610,"date":"2024-02-08T16:25:33","date_gmt":"2024-02-08T16:25:33","guid":{"rendered":"https:\/\/www.artefact.com\/?post_type=blog&#038;p=95610"},"modified":"2024-09-20T17:46:02","modified_gmt":"2024-09-20T16:46:02","slug":"why-you-need-llmops","status":"publish","type":"blog","link":"https:\/\/www.artefact.com\/nl\/blog\/why-you-need-llmops\/","title":{"rendered":"Waarom u LLMOps nodig hebt"},"content":{"rendered":"<p><div class=\"fusion-fullwidth fullwidth-box fusion-builder-row-1 fusion-flex-container nonhundred-percent-fullwidth non-hundred-percent-height-scrolling article-author\" style=\"--awb-border-radius-top-left:0px;--awb-border-radius-top-right:0px;--awb-border-radius-bottom-right:0px;--awb-border-radius-bottom-left:0px;--awb-background-color:#ffffff;--awb-flex-wrap:wrap;\" ><div class=\"fusion-builder-row fusion-row fusion-flex-align-items-flex-start fusion-flex-content-wrap\" style=\"max-width:calc( 1440px + 20px );margin-left: calc(-20px \/ 2 );margin-right: calc(-20px \/ 2 );\"><div class=\"fusion-layout-column fusion_builder_column fusion-builder-column-0 fusion_builder_column_1_2 1_2 fusion-flex-column\" style=\"--awb-bg-size:cover;--awb-width-large:50%;--awb-margin-top-large:0px;--awb-spacing-right-large:10px;--awb-margin-bottom-large:0px;--awb-spacing-left-large:10px;--awb-width-medium:50%;--awb-order-medium:0;--awb-spacing-right-medium:10px;--awb-spacing-left-medium:10px;--awb-width-small:100%;--awb-order-small:0;--awb-spacing-right-small:10px;--awb-spacing-left-small:10px;\"><div class=\"fusion-column-wrapper fusion-column-has-shadow fusion-flex-justify-content-flex-start fusion-content-layout-column\"><div class=\"fusion-title title fusion-title-1 fusion-sep-none fusion-title-text fusion-title-size-two\" style=\"--awb-margin-bottom-small:8px;\"><h2 class=\"fusion-title-heading title-heading-left fusion-responsive-typography-calculated\" style=\"margin:0;--fontSize:50;line-height:1.2;\">Auteur<\/h2><\/div><img decoding=\"async\" src=\"data:image\/svg+xml,%3Csvg%20xmlns%3D%27http%3A%2F%2Fwww.w3.org%2F2000%2Fsvg%27%20width%3D%27150%27%20height%3D%270%27%20viewBox%3D%270%200%20150%200%27%3E%3Crect%20width%3D%27150%27%20height%3D%270%27%20fill-opacity%3D%220%22%2F%3E%3C%2Fsvg%3E\" data-orig-src=\"https:\/\/www.artefact.com\/\/wp-content\/uploads\/2024\/02\/meryam-assermouh.jpg\" alt=\"Image\" class=\"lazyload artefact-elegant-image align-left article-author-image\" style=\"width: 150px; border-radius: 54% 46% 77% 23% \/ 74% 40% 60% 26%; overflow: hidden;\" width=\"150\" height=\"auto\" \/><div class=\"fusion-title title fusion-title-2 fusion-sep-none fusion-title-text fusion-title-size-three article-author-name-title\" style=\"--awb-text-color:var(--awb-color7);--awb-margin-bottom-small:8px;--awb-font-size:18px;\"><h3 class=\"fusion-title-heading title-heading-left fusion-responsive-typography-calculated\" style=\"font-family:&quot;Josefin Sans&quot;;font-style:normal;font-weight:600;margin:0;font-size:1em;--fontSize:18;line-height:1.5;\">Meryam Assermouh<\/h3><\/div><div class=\"fusion-text fusion-text-1 article-author-description\" style=\"--awb-font-size:14px;--awb-line-height:1.6;--awb-letter-spacing:2px;--awb-text-transform:uppercase;--awb-text-color:var(--awb-color7);--awb-text-font-family:&quot;Roboto&quot;;--awb-text-font-style:normal;--awb-text-font-weight:400;\"><p>Data Engineer <a href=\"https:\/\/www.artefact.com\/nl\/\">bij Artefact Frankrijk<\/a><\/p>\n<\/div><\/div><\/div><\/div><\/div><article class=\"fusion-fullwidth fullwidth-box fusion-builder-row-2 fusion-flex-container nonhundred-percent-fullwidth non-hundred-percent-height-scrolling\" style=\"--link_color: var(--awb-color6);--awb-border-radius-top-left:0px;--awb-border-radius-top-right:0px;--awb-border-radius-bottom-right:0px;--awb-border-radius-bottom-left:0px;--awb-background-color:var(--awb-color1);--awb-flex-wrap:wrap;\" ><div class=\"fusion-builder-row fusion-row fusion-flex-align-items-flex-start fusion-flex-justify-content-center fusion-flex-content-wrap\" style=\"max-width:calc( 1440px + 20px );margin-left: calc(-20px \/ 2 );margin-right: calc(-20px \/ 2 );\"><div class=\"fusion-layout-column fusion_builder_column fusion-builder-column-1 fusion_builder_column_1_1 1_1 fusion-flex-column\" style=\"--awb-bg-size:cover;--awb-width-large:100%;--awb-margin-top-large:0px;--awb-spacing-right-large:10px;--awb-margin-bottom-large:0px;--awb-spacing-left-large:10px;--awb-width-medium:100%;--awb-order-medium:0;--awb-spacing-right-medium:10px;--awb-spacing-left-medium:10px;--awb-width-small:100%;--awb-order-small:0;--awb-spacing-right-small:10px;--awb-spacing-left-small:10px;\"><div class=\"fusion-column-wrapper fusion-column-has-shadow fusion-flex-justify-content-flex-start fusion-content-layout-column\"><div class=\"fusion-title title fusion-title-3 fusion-sep-none fusion-title-text fusion-title-size-two\" style=\"--awb-text-color:var(--awb-color6);--awb-margin-bottom-small:8px;--awb-font-size:30px;\"><h2 class=\"fusion-title-heading title-heading-left fusion-responsive-typography-calculated\" style=\"font-family:&quot;PT Serif&quot;;font-style:normal;font-weight:700;margin:0;letter-spacing:1.6px;font-size:1em;--fontSize:30;line-height:1.47;\">TL;DR<\/h2><\/div><div class=\"fusion-text fusion-text-2\" style=\"--awb-font-size:20px;--awb-line-height:1.6;--awb-letter-spacing:var(--awb-typography4-letter-spacing);--awb-text-transform:var(--awb-typography4-text-transform);--awb-text-color:var(--awb-color5);--awb-text-font-family:var(--awb-typography4-font-family);--awb-text-font-weight:var(--awb-typography4-font-weight);--awb-text-font-style:var(--awb-typography4-font-style);\"><p>Dit artikel introduceert LLMOps, een gespecialiseerde tak die DevOps en MLOps samenvoegt voor <strong>De uitdagingen van grote taalmodellen beheren<\/strong> (LLM's). LLM's, zoals GPT van OpenAI, gebruiken uitgebreide tekst data voor taken zoals tekstgeneratie en taalvertaling. LLMOps pakt problemen aan zoals <strong>aanpassing, API-wijzigingen, data drift, modelevaluatie en bewaking<\/strong> via tools zoals LangSmith, TruLens en W&amp;B Prompts. Het zorgt voor aanpasbaarheid, evaluatie en bewaking van LLM's in real-world scenario's, en biedt een allesomvattende oplossing voor organisaties die gebruik maken van deze geavanceerde taalmodellen.<\/p>\n<\/div><div class=\"fusion-text fusion-text-3\" style=\"--awb-font-size:20px;--awb-line-height:1.6;--awb-letter-spacing:var(--awb-typography4-letter-spacing);--awb-text-transform:var(--awb-typography4-text-transform);--awb-text-color:var(--awb-color5);--awb-text-font-family:var(--awb-typography4-font-family);--awb-text-font-weight:var(--awb-typography4-font-weight);--awb-text-font-style:var(--awb-typography4-font-style);\"><p>Om u door deze discussie te leiden, gaan we eerst in op de basisprincipes van DevOps en MLOps, waarna we ons richten op LLMOps, te beginnen met een korte introductie van LLM's en het gebruik ervan door organisaties. Vervolgens gaan we dieper in op de belangrijkste operationele uitdagingen van LLM-technologie en hoe LLMOps deze effectief aanpakt.<\/p>\n<\/div><div class=\"fusion-title title fusion-title-4 fusion-sep-none fusion-title-text fusion-title-size-two\" style=\"--awb-text-color:var(--awb-color6);--awb-margin-bottom-small:8px;--awb-font-size:30px;\"><h2 class=\"fusion-title-heading title-heading-left fusion-responsive-typography-calculated\" style=\"font-family:&quot;PT Serif&quot;;font-style:normal;font-weight:700;margin:0;letter-spacing:1.6px;font-size:1em;--fontSize:30;line-height:1.47;\">Basisprincipes voor LLMOps: DevOps en MLOps<\/h2><\/div><div class=\"fusion-text fusion-text-4\" style=\"--awb-font-size:20px;--awb-line-height:1.6;--awb-letter-spacing:var(--awb-typography4-letter-spacing);--awb-text-transform:var(--awb-typography4-text-transform);--awb-text-color:var(--awb-color5);--awb-text-font-family:var(--awb-typography4-font-family);--awb-text-font-weight:var(--awb-typography4-font-weight);--awb-text-font-style:var(--awb-typography4-font-style);\"><p>DevOps, een afkorting van Development and Operations, is een verzameling praktijken die tot doel hebben het softwareleveringsproces te automatiseren, waardoor het effici\u00ebnter, betrouwbaarder en schaalbaarder wordt. De kernprincipes van DevOps zijn onder andere: samenwerking, automatisering, continu testen, bewaking en implementatie-orkestratie.<\/p>\n<\/div><div class=\"fusion-text fusion-text-5\" style=\"--awb-font-size:20px;--awb-line-height:1.6;--awb-letter-spacing:var(--awb-typography4-letter-spacing);--awb-text-transform:var(--awb-typography4-text-transform);--awb-text-color:var(--awb-color5);--awb-text-font-family:var(--awb-typography4-font-family);--awb-text-font-weight:var(--awb-typography4-font-weight);--awb-text-font-style:var(--awb-typography4-font-style);\"><p>MLOps, kort voor Machine Learning Operations, is een uitbreiding van DevOps-praktijken die specifiek is afgestemd op het levenscyclusbeheer van modellen voor machinaal leren. Het richt zich op de unieke uitdagingen die de iteratieve en experimentele aard van de ontwikkeling van machine learning met zich meebrengt. Het introduceert extra taken zoals data versiebeheer, experimenteren en modeltraining.<\/p>\n<\/div><div class=\"fusion-title title fusion-title-5 fusion-sep-none fusion-title-text fusion-title-size-two\" style=\"--awb-text-color:var(--awb-color6);--awb-margin-bottom-small:8px;--awb-font-size:30px;\"><h2 class=\"fusion-title-heading title-heading-left fusion-responsive-typography-calculated\" style=\"font-family:&quot;PT Serif&quot;;font-style:normal;font-weight:700;margin:0;letter-spacing:1.6px;font-size:1em;--fontSize:30;line-height:1.47;\">LLMOps: Beheer van de implementatie en het onderhoud van grote taalmodellen<\/h2><\/div><div class=\"fusion-text fusion-text-6\"><p>LLMOps, kort voor Large Language Model Operations, is een gespecialiseerde tak van MLOps die speciaal ontworpen is om de unieke uitdagingen en vereisten van het beheren van grote taalmodellen (LLM's) aan te kunnen.<\/p>\n<\/div><div class=\"fusion-title title fusion-title-6 fusion-sep-none fusion-title-text fusion-title-size-two\" style=\"--awb-text-color:var(--awb-color6);--awb-margin-bottom-small:8px;--awb-font-size:30px;\"><h2 class=\"fusion-title-heading title-heading-left fusion-responsive-typography-calculated\" style=\"font-family:&quot;PT Serif&quot;;font-style:normal;font-weight:700;margin:0;letter-spacing:1.6px;font-size:1em;--fontSize:30;line-height:1.47;\">Maar eerst, wat zijn LLM's precies?<\/h2><\/div><div class=\"fusion-text fusion-text-7\"><p>LLM's zijn een soort deep learning-modellen die enorme hoeveelheden tekst data gebruiken om miljarden parameters te schatten. Dankzij deze parameters kunnen LLM's tekst van menselijke kwaliteit begrijpen en genereren, talen vertalen, complexe informatie samenvatten en verschillende taken op het gebied van natuurlijke taalverwerking uitvoeren.<\/p>\n<\/div><div class=\"fusion-title title fusion-title-7 fusion-sep-none fusion-title-text fusion-title-size-two\" style=\"--awb-text-color:var(--awb-color6);--awb-margin-bottom-small:8px;--awb-font-size:30px;\"><h2 class=\"fusion-title-heading title-heading-left fusion-responsive-typography-calculated\" style=\"font-family:&quot;PT Serif&quot;;font-style:normal;font-weight:700;margin:0;letter-spacing:1.6px;font-size:1em;--fontSize:30;line-height:1.47;\">Hoe organisaties LLM's gebruiken<\/h2><\/div><div class=\"fusion-text fusion-text-8\"><p>Omdat het trainen van LLM's vanaf nul extreem duur en tijdrovend is, kiezen organisaties voor voorgetrainde basismodellen, zoals GPT van OpenAI of LaMDA van Google AI, als uitgangspunt. Deze modellen, die al getraind zijn op grote hoeveelheden data, beschikken over uitgebreide kennis en kunnen verschillende taken uitvoeren, zoals het genereren van tekst, het vertalen van talen en het schrijven van verschillende soorten creatieve content. Om de output van de LLM verder aan te passen aan specifieke taken of domeinen, maken organisaties gebruik van technieken zoals prompt engineering, retrieval-augmented generation (RAG) en fine-tuning. Prompt engineering omvat het maken van duidelijke en beknopte instructies die de LLM naar het gewenste resultaat leiden, terwijl RAG het model baseert op aanvullende informatie van externe data bronnen, waardoor de prestaties en relevantie worden verbeterd. Bij fine-tuning worden de parameters van de LLM aangepast met behulp van extra data die specifiek zijn voor de behoeften van de organisatie. Het onderstaande schema geeft een overzicht van de LLMOps workflow en laat zien hoe deze technieken in het algemene proces ge\u00efntegreerd zijn.<\/p>\n<\/div><div class=\"fusion-image-element\" style=\"text-align:center;--awb-caption-title-font-family:var(--h2_typography-font-family);--awb-caption-title-font-weight:var(--h2_typography-font-weight);--awb-caption-title-font-style:var(--h2_typography-font-style);--awb-caption-title-size:var(--h2_typography-font-size);--awb-caption-title-transform:var(--h2_typography-text-transform);--awb-caption-title-line-height:var(--h2_typography-line-height);--awb-caption-title-letter-spacing:var(--h2_typography-letter-spacing);\"><span class=\"fusion-imageframe imageframe-none imageframe-1 hover-type-none\"><img decoding=\"async\" width=\"1336\" height=\"463\" title=\"sch\u00e9ma LLMOps medium artikel\" src=\"https:\/\/www.artefact.com\/\/wp-content\/uploads\/2024\/02\/schema-LLMOps-medium-article.png\" data-orig-src=\"https:\/\/www.artefact.com\/\/wp-content\/uploads\/2024\/02\/schema-LLMOps-medium-article.png\" alt class=\"lazyload img-responsive wp-image-95612\" srcset=\"data:image\/svg+xml,%3Csvg%20xmlns%3D%27http%3A%2F%2Fwww.w3.org%2F2000%2Fsvg%27%20width%3D%271336%27%20height%3D%27463%27%20viewBox%3D%270%200%201336%20463%27%3E%3Crect%20width%3D%271336%27%20height%3D%27463%27%20fill-opacity%3D%220%22%2F%3E%3C%2Fsvg%3E\" data-srcset=\"https:\/\/www.artefact.com\/\/wp-content\/uploads\/2024\/02\/schema-LLMOps-medium-article-200x69.png 200w, https:\/\/www.artefact.com\/\/wp-content\/uploads\/2024\/02\/schema-LLMOps-medium-article-400x139.png 400w, https:\/\/www.artefact.com\/\/wp-content\/uploads\/2024\/02\/schema-LLMOps-medium-article-600x208.png 600w, https:\/\/www.artefact.com\/\/wp-content\/uploads\/2024\/02\/schema-LLMOps-medium-article-800x277.png 800w, https:\/\/www.artefact.com\/\/wp-content\/uploads\/2024\/02\/schema-LLMOps-medium-article-1200x416.png 1200w, https:\/\/www.artefact.com\/\/wp-content\/uploads\/2024\/02\/schema-LLMOps-medium-article.png 1336w\" data-sizes=\"auto\" data-orig-sizes=\"(max-width: 640px) 100vw, 1336px\" \/><\/span><\/div><div class=\"fusion-title title fusion-title-8 fusion-sep-none fusion-title-text fusion-title-size-two\" style=\"--awb-text-color:var(--awb-color6);--awb-margin-bottom-small:8px;--awb-font-size:30px;\"><h2 class=\"fusion-title-heading title-heading-left fusion-responsive-typography-calculated\" style=\"font-family:&quot;PT Serif&quot;;font-style:normal;font-weight:700;margin:0;letter-spacing:1.6px;font-size:1em;--fontSize:30;line-height:1.47;\">Waarom we LLMOps nodig hebben<\/h2><\/div><div class=\"fusion-text fusion-text-9\"><p>De snelle vooruitgang in de LLM-technologie heeft verschillende operationele uitdagingen aan het licht gebracht die een gespecialiseerde aanpak vereisen.<\/p>\n<p>Enkele van deze uitdagingen zijn :<\/p>\n<\/div><ul style=\"--awb-iconcolor:var(--awb-color7);--awb-textcolor:var(--awb-color7);--awb-line-height:27.2px;--awb-icon-width:27.2px;--awb-icon-height:27.2px;--awb-icon-margin:11.2px;--awb-content-margin:38.4px;\" class=\"fusion-checklist fusion-checklist-1 fusion-checklist-default type-icons paddingList dark-text\"><li class=\"fusion-li-item\" style=\"\"><span class=\"icon-wrapper circle-no\"><i class=\"fusion-li-icon awb-icon-check\" aria-hidden=\"true\"><\/i><\/span><div class=\"fusion-li-item-content\">\n<p><strong>De behoefte aan maatwerk<\/strong>: Hoewel LLM's vooraf getraind zijn op enorme hoeveelheden data, is aanpassing essentieel voor optimale prestaties bij specifieke taken. Dit heeft geleid tot de ontwikkeling van nieuwe aanpassingstechnieken, zoals <strong>snelle engineering<\/strong>, ophalen-ondersteund genereren (<strong>RAG<\/strong>) en <strong>fijnafstemming<\/strong>. RAG helpt het model om zich te baseren op de meest nauwkeurige informatie door het te voorzien van een externe kennisbank, terwijl fijnafstemming meer geschikt is als we willen dat het model specifieke taken uitvoert, of zich houdt aan een bepaald antwoordformaat zoals JSON of SQL. De keuze tussen RAG en fine-tuning hangt af van de vraag of we de kennis van het model willen vergroten of de prestaties ervan in een specifieke taak willen verbeteren.<\/p>\n<\/div><\/li><li class=\"fusion-li-item\" style=\"\"><span class=\"icon-wrapper circle-no\"><i class=\"fusion-li-icon awb-icon-check\" aria-hidden=\"true\"><\/i><\/span><div class=\"fusion-li-item-content\">\n<p><strong>API-wijzigingen<\/strong>: In tegenstelling tot traditionele ML-modellen, zijn LLM's vaak toegankelijk via API's van derden, die kunnen worden gewijzigd of zelfs afgeschaft, waardoor voortdurende controle en aanpassing nodig is. Bijvoorbeeld, <a href=\"https:\/\/platform.openai.com\/docs\/deprecations\" target=\"_blank\" rel=\"noopener\">Open AI's documentatie<\/a> vermeldt expliciet dat hun modellen regelmatig worden bijgewerkt, waardoor gebruikers mogelijk hun software moeten bijwerken of moeten migreren naar nieuwere modellen of eindpunten.<\/p>\n<\/div><\/li><li class=\"fusion-li-item\" style=\"\"><span class=\"icon-wrapper circle-no\"><i class=\"fusion-li-icon awb-icon-check\" aria-hidden=\"true\"><\/i><\/span><div class=\"fusion-li-item-content\">\n<p><strong>Data afwijking<\/strong>, verwijst naar een verschuiving in de statistische eigenschappen van input data, die vaak optreedt in de productie wanneer de aangetroffen data afwijkt van de data waarop de LLM's getraind zijn. Dit kan leiden tot het genereren van onnauwkeurige of verouderde informatie. Bij het GPT-3.5-model was de informatie bijvoorbeeld beperkt tot september 2021 voordat <a href=\"https:\/\/www.zdnet.com\/article\/chatgpt-is-no-longer-as-clueless-about-recent-events\/\" target=\"_blank\" rel=\"noopener\">de sluitingsdatum werd verlengd tot januari 2022<\/a>. Daardoor kon het vragen over recentere gebeurtenissen niet beantwoorden, wat leidde tot frustratie bij gebruikers.<\/p>\n<\/div><\/li><li class=\"fusion-li-item\" style=\"\"><span class=\"icon-wrapper circle-no\"><i class=\"fusion-li-icon awb-icon-check\" aria-hidden=\"true\"><\/i><\/span><div class=\"fusion-li-item-content\">\n<p><strong>Modelevaluatie<\/strong>: Bij traditioneel machinaal leren vertrouwen we op metrieken als acccuracy, precision en recall om onze modellen te beoordelen. Het evalueren van LLM's is echter aanzienlijk ingewikkelder, vooral bij het ontbreken van ground truth data en wanneer we te maken hebben met natuurlijke taaluitvoer in plaats van numerieke waarden.<\/p>\n<\/div><\/li><li class=\"fusion-li-item\" style=\"\"><span class=\"icon-wrapper circle-no\"><i class=\"fusion-li-icon awb-icon-check\" aria-hidden=\"true\"><\/i><\/span><div class=\"fusion-li-item-content\">\n<p><strong>Bewaking<\/strong>: Voortdurende bewaking van LLM's en op LLM gebaseerde toepassingen is van cruciaal belang. Het is ook gecompliceerder omdat er meerdere aspecten bij komen kijken die overwogen moeten worden om de algehele effectiviteit en betrouwbaarheid van deze taalmodellen te garanderen. We zullen deze aspecten in meer detail bespreken in het volgende gedeelte.<\/p>\n<\/div><\/li><\/ul><div class=\"fusion-title title fusion-title-9 fusion-sep-none fusion-title-text fusion-title-size-two\" style=\"--awb-text-color:var(--awb-color6);--awb-margin-bottom-small:8px;--awb-font-size:30px;\"><h2 class=\"fusion-title-heading title-heading-left fusion-responsive-typography-calculated\" style=\"font-family:&quot;PT Serif&quot;;font-style:normal;font-weight:700;margin:0;letter-spacing:1.6px;font-size:1em;--fontSize:30;line-height:1.47;\">Hoe LLMOps deze uitdagingen aanpakt<\/h2><\/div><div class=\"fusion-text fusion-text-10\"><p>LLMOps bouwt voort op het fundament van MLOps en introduceert gespecialiseerde onderdelen op maat van LLM's :<\/p>\n<\/div><ul style=\"--awb-iconcolor:var(--awb-color7);--awb-textcolor:var(--awb-color7);--awb-line-height:27.2px;--awb-icon-width:27.2px;--awb-icon-height:27.2px;--awb-icon-margin:11.2px;--awb-content-margin:38.4px;\" class=\"fusion-checklist fusion-checklist-2 fusion-checklist-default type-icons\"><li class=\"fusion-li-item\" style=\"\"><span class=\"icon-wrapper circle-no\"><i class=\"fusion-li-icon awb-icon-check\" aria-hidden=\"true\"><\/i><\/span><div class=\"fusion-li-item-content\">\n<p><strong>Snel engineering- en afstemmingsbeheer<\/strong>: LLMOps biedt hulpmiddelen zoals <strong>snelle versiebeheersystemen<\/strong> om verschillende versies van prompts bij te houden en te beheren. Het integreert ook met <strong>kaders verfijnen<\/strong> om het fine-tuning proces te automatiseren en te optimaliseren. Een prominent voorbeeld van deze hulpmiddelen is LangSmith, een framework dat speciaal is ontworpen voor het beheer van LLM-workflows. De uitgebreide functies omvatten <a href=\"https:\/\/docs.smith.langchain.com\/cookbook\/hub-examples\/retrieval-qa-chain-versioned\" target=\"_blank\" rel=\"noopener\">prompt versiebeheer<\/a>, waardoor gecontroleerde experimenten en reproduceerbaarheid mogelijk zijn. LangSmith vergemakkelijkt bovendien <a href=\"https:\/\/docs.smith.langchain.com\/cookbook\/fine-tuning-examples\" target=\"_blank\" rel=\"noopener\">fijnafstemming<\/a> van LLM's met runs'data na eventuele filtering en verrijking om de modelprestaties te verbeteren.<\/p>\n<\/div><\/li><li class=\"fusion-li-item\" style=\"\"><span class=\"icon-wrapper circle-no\"><i class=\"fusion-li-icon awb-icon-check\" aria-hidden=\"true\"><\/i><\/span><div class=\"fusion-li-item-content\">\n<p><strong>API-veranderingsbeheer<\/strong>: LLMOps stelt processen vast voor <strong>bewaking<\/strong> API wijzigingen, <strong>waarschuwen<\/strong> exploitanten voor potenti\u00eble onderbrekingen, en <strong>rollbacks inschakelen<\/strong> indien nodig.<\/p>\n<\/div><\/li><li class=\"fusion-li-item\" style=\"\"><span class=\"icon-wrapper circle-no\"><i class=\"fusion-li-icon awb-icon-check\" aria-hidden=\"true\"><\/i><\/span><div class=\"fusion-li-item-content\">\n<p><strong>Modelaanpassing aan veranderende data<\/strong>: LLMOps vergemakkelijkt de aanpassing van LLM's aan evoluerende data-landschappen, door ervoor te zorgen dat modellen relevant en performant blijven als data-patronen verschuiven. Dit kan worden bereikt door <strong>bewaking van data-distributies en het in gang zetten van aanpassingsprocessen<\/strong> wanneer er significante veranderingen worden gedetecteerd. Deze processen kunnen het volgende omvatten:<br \/>\n-&gt; <strong>Omscholing of afstemming<\/strong>: Afhankelijk van de mate van data drift en de beschikbare middelen, kan herscholing of fijnafstelling worden toegepast om de gevolgen te beperken.<br \/>\n-&gt; <strong>Domeinaanpassing<\/strong>: Fijnafstemming van de LLM op een dataset van het doeldomein.<br \/>\n-&gt; <strong>Destillatie van kennis<\/strong>: Een kleiner model trainen door gebruik te maken van de kennis en expertise van een groter, krachtiger, up-to-date model.<\/p>\n<\/div><\/li><li class=\"fusion-li-item\" style=\"\"><span class=\"icon-wrapper circle-no\"><i class=\"fusion-li-icon awb-icon-check\" aria-hidden=\"true\"><\/i><\/span><div class=\"fusion-li-item-content\">\n<p><strong>LLM-specifieke evaluatie<\/strong>: LLMOps gebruikt nieuwe evaluatie-instrumenten die aangepast zijn aan LLLM's. Deze omvatten:<br \/>\n-&gt; <strong>Op tekst gebaseerde statistieken<\/strong>, Zoals perplexiteit; een statistische maat voor hoe goed het model het volgende woord in een reeks kan voorspellen. Net als BLEU- en ROUGE-metriek, die machinaal gegenereerde tekst vergelijken met een of meer referentieteksten die door mensen zijn gegenereerd. Ze worden vaak gebruikt voor vertaal- en samenvat taken.<br \/>\n-&gt; <strong>Inbeddingen analyseren<\/strong> (vectorrepresentaties voor woorden of zinnen), om het vermogen van het model te beoordelen om contextspecifieke woorden te begrijpen en semantische overeenkomsten vast te leggen. Visualisatie- en clusteringstechnieken kunnen ons ook helpen bij het detecteren van vertekeningen.<br \/>\n-&gt; <strong>Evaluator LLM's<\/strong>: Andere LLM's gebruiken om ons model te evalueren. Dit kan bijvoorbeeld worden gedaan door een score toe te kennen aan de uitvoer van het ge\u00ebvalueerde model op basis van vooraf gedefinieerde metriek, zoals vloeiendheid, samenhang, relevantie en feitelijke nauwkeurigheid.<br \/>\n-&gt; <strong>Integratie van menselijke feedback<\/strong>: LLMOps bevat mechanismen voor het verzamelen en opnemen van menselijke feedback in de ML-levenscyclus, waardoor de prestaties van LLM worden verbeterd en vooroordelen worden aangepakt.<br \/>\n<a href=\"https:\/\/www.trulens.org\/trulens_eval\/core_concepts_feedback_functions\/#feedback-functions\" target=\"_blank\" rel=\"noopener\">TruLens<\/a> is een hulpmiddel dat integratie van deze evaluaties in LLM-toepassingen mogelijk maakt via een programmatische aanpak die bekend staat als Feedbackfuncties.<\/p>\n<\/div><\/li><li class=\"fusion-li-item\" style=\"\"><span class=\"icon-wrapper circle-no\"><i class=\"fusion-li-icon awb-icon-check\" aria-hidden=\"true\"><\/i><\/span><div class=\"fusion-li-item-content\">\n<p><strong>LLM-specifieke bewaking<\/strong>: LLMOps integreert continue bewaking om de prestatiecijfers van LLM bij te houden, mogelijke problemen te identificeren en conceptdrift of vooringenomenheid te detecteren. Dit omvat:<br \/>\n-&gt; <strong>Functionele bewaking<\/strong>; door het aantal aanvragen, de responstijd, het tokengebruik, het foutenpercentage en de kosten bij te houden.<br \/>\n-&gt; <strong>Prompt toezicht<\/strong>; om de leesbaarheid te garanderen en om toxiciteit en andere vormen van misbruik op te sporen. <a href=\"https:\/\/docs.wandb.ai\/guides\/prompts_platform\" target=\"_blank\" rel=\"noopener\">W&amp;B-prompts<\/a> is een set hulpmiddelen die is ontworpen voor het bewaken van LLM-gebaseerde toepassingen. Het kan worden gebruikt om de invoer en uitvoer van uw LLM's te analyseren, de tussenresultaten te bekijken en uw prompts veilig op te slaan en te beheren.<br \/>\n-&gt; <strong>Responscontrole<\/strong>; om de relevantie en consistentie van het model te garanderen. Dit omvat het voorkomen van het genereren van hallucinante of fictieve inhoud, evenals het garanderen van de uitsluiting van schadelijk of ongepast materiaal. Transparantie kan ons helpen om het antwoord van het model beter te begrijpen. Dit kan worden bewerkstelligd door antwoordbronnen te onthullen (in RAG) of door het model te vragen om zijn redenering te rechtvaardigen (denkketen).<\/p>\n<\/div><\/li><\/ul><div class=\"fusion-text fusion-text-11\"><p>Deze monitoring data kan gebruikt worden om de operationele effici\u00ebntie te verbeteren. We kunnen het kostenbeheer verbeteren door waarschuwingen over tokengebruik te implementeren en strategie\u00ebn toe te passen zoals het cachen van eerdere antwoorden. Hierdoor kunnen we deze hergebruiken voor soortgelijke query's zonder de LLM opnieuw aan te roepen. Daarnaast kunnen we de latentie minimaliseren door waar mogelijk voor kleinere modellen te kiezen en het aantal gegenereerde tokens te beperken.<\/p>\n<\/div><div class=\"fusion-title title fusion-title-10 fusion-sep-none fusion-title-text fusion-title-size-two\" style=\"--awb-text-color:var(--awb-color6);--awb-margin-bottom-small:8px;--awb-font-size:30px;\"><h2 class=\"fusion-title-heading title-heading-left fusion-responsive-typography-calculated\" style=\"font-family:&quot;PT Serif&quot;;font-style:normal;font-weight:700;margin:0;letter-spacing:1.6px;font-size:1em;--fontSize:30;line-height:1.47;\">Conclusie<\/h2><\/div><div class=\"fusion-text fusion-text-12\"><p>In dit artikel onderzochten we de opkomst van LLMOps, een afstammeling van DevOps en MLOps, speciaal ontworpen om de operationele uitdagingen van grote taalmodellen aan te pakken. Laten we afsluiten met een visuele vergelijking van deze drie methodologie\u00ebn, waarbij we hun reikwijdte illustreren binnen de context van bedrijven die gebruik maken van LLM, die deze modellen gebruiken om producten te maken en bedrijfsproblemen op te lossen.<\/p>\n<\/div><div class=\"fusion-image-element\" style=\"text-align:center;--awb-caption-title-font-family:var(--h2_typography-font-family);--awb-caption-title-font-weight:var(--h2_typography-font-weight);--awb-caption-title-font-style:var(--h2_typography-font-style);--awb-caption-title-size:var(--h2_typography-font-size);--awb-caption-title-transform:var(--h2_typography-text-transform);--awb-caption-title-line-height:var(--h2_typography-line-height);--awb-caption-title-letter-spacing:var(--h2_typography-letter-spacing);\"><span class=\"fusion-imageframe imageframe-none imageframe-2 hover-type-none\"><img decoding=\"async\" width=\"927\" height=\"727\" title=\"sch\u00e9ma 2 LLMOps medium artikel\" src=\"https:\/\/www.artefact.com\/\/wp-content\/uploads\/2024\/02\/schema-2-LLMOps-medium-article.png\" data-orig-src=\"https:\/\/www.artefact.com\/\/wp-content\/uploads\/2024\/02\/schema-2-LLMOps-medium-article.png\" alt class=\"lazyload img-responsive wp-image-95613\" srcset=\"data:image\/svg+xml,%3Csvg%20xmlns%3D%27http%3A%2F%2Fwww.w3.org%2F2000%2Fsvg%27%20width%3D%27927%27%20height%3D%27727%27%20viewBox%3D%270%200%20927%20727%27%3E%3Crect%20width%3D%27927%27%20height%3D%27727%27%20fill-opacity%3D%220%22%2F%3E%3C%2Fsvg%3E\" data-srcset=\"https:\/\/www.artefact.com\/\/wp-content\/uploads\/2024\/02\/schema-2-LLMOps-medium-article-200x157.png 200w, https:\/\/www.artefact.com\/\/wp-content\/uploads\/2024\/02\/schema-2-LLMOps-medium-article-400x314.png 400w, https:\/\/www.artefact.com\/\/wp-content\/uploads\/2024\/02\/schema-2-LLMOps-medium-article-600x471.png 600w, https:\/\/www.artefact.com\/\/wp-content\/uploads\/2024\/02\/schema-2-LLMOps-medium-article-800x627.png 800w, https:\/\/www.artefact.com\/\/wp-content\/uploads\/2024\/02\/schema-2-LLMOps-medium-article.png 927w\" data-sizes=\"auto\" data-orig-sizes=\"(max-width: 640px) 100vw, 927px\" \/><\/span><\/div><div class=\"fusion-text fusion-text-13\"><p>Hoewel de drie methodologie\u00ebn gemeenschappelijke praktijken delen, zoals CI\/CD, versiebeheer en evaluatie, hebben ze elk hun eigen aandachtsgebieden. DevOps omvat de volledige levenscyclus van softwareontwikkeling, van ontwikkeling tot implementatie en onderhoud. MLOps breidt DevOps uit om de specifieke uitdagingen van machine-learningmodellen aan te pakken, waaronder het automatiseren van modeltraining, implementatie en bewaking. LLMOps, de nieuwste iteratie van deze methodologie\u00ebn, richt zich specifiek op LLM's. Hoewel LLM-gebruikers hun eigen modellen niet hoeven te ontwikkelen, hebben ze nog steeds te maken met operationele uitdagingen, zoals het beheren van API-wijzigingen en het aanpassen van modellen via technieken zoals prompt engineering en fine-tuning.<\/p>\n<\/div><\/div><\/div><\/div><\/article><div class=\"fusion-fullwidth fullwidth-box fusion-builder-row-3 fusion-flex-container nonhundred-percent-fullwidth non-hundred-percent-height-scrolling\" style=\"--awb-border-radius-top-left:0px;--awb-border-radius-top-right:0px;--awb-border-radius-bottom-right:0px;--awb-border-radius-bottom-left:0px;--awb-margin-top:40px;--awb-margin-bottom:40px;--awb-background-color:var(--awb-color1);--awb-flex-wrap:wrap;\" ><div class=\"fusion-builder-row fusion-row fusion-flex-align-items-center fusion-flex-justify-content-center fusion-flex-content-wrap\" style=\"max-width:calc( 1440px + 20px );margin-left: calc(-20px \/ 2 );margin-right: calc(-20px \/ 2 );\"><div class=\"fusion-layout-column fusion_builder_column fusion-builder-column-2 fusion_builder_column_1_1 1_1 fusion-flex-column fusion-flex-align-self-center\" style=\"--awb-padding-top:40px;--awb-padding-right:40px;--awb-padding-bottom:40px;--awb-padding-left:40px;--awb-overflow:hidden;--awb-bg-position:left center;--awb-bg-size:cover;--awb-border-color:rgba(10,17,40,0.1);--awb-border-style:solid;--awb-border-radius:4px 4px 4px 4px;--awb-width-large:100%;--awb-margin-top-large:0px;--awb-spacing-right-large:10px;--awb-margin-bottom-large:0px;--awb-spacing-left-large:10px;--awb-width-medium:100%;--awb-order-medium:0;--awb-spacing-right-medium:10px;--awb-spacing-left-medium:10px;--awb-width-small:100%;--awb-order-small:0;--awb-spacing-right-small:10px;--awb-spacing-left-small:10px;\"><div class=\"fusion-column-wrapper lazyload fusion-column-has-shadow fusion-flex-justify-content-center fusion-content-layout-column fusion-column-has-bg-image\" data-bg-url=\"https:\/\/artefact.com\/\/wp-content\/uploads\/2021\/03\/background.jpg\" data-bg=\"https:\/\/artefact.com\/\/wp-content\/uploads\/2021\/03\/background.jpg\"><div class=\"fusion-image-element\" style=\"text-align:center;--awb-margin-right:20px;--awb-margin-left:20px;--awb-max-width:150px;--awb-caption-title-font-family:var(--h2_typography-font-family);--awb-caption-title-font-weight:var(--h2_typography-font-weight);--awb-caption-title-font-style:var(--h2_typography-font-style);--awb-caption-title-size:var(--h2_typography-font-size);--awb-caption-title-transform:var(--h2_typography-text-transform);--awb-caption-title-line-height:var(--h2_typography-line-height);--awb-caption-title-letter-spacing:var(--h2_typography-letter-spacing);\"><span class=\"fusion-imageframe imageframe-none imageframe-3 hover-type-none\"><img decoding=\"async\" width=\"72\" height=\"41\" title=\"middelgrote\" src=\"data:image\/svg+xml,%3Csvg%20xmlns%3D%27http%3A%2F%2Fwww.w3.org%2F2000%2Fsvg%27%20width%3D%2772%27%20height%3D%2741%27%20viewBox%3D%270%200%2072%2041%27%3E%3Crect%20width%3D%2772%27%20height%3D%2741%27%20fill-opacity%3D%220%22%2F%3E%3C%2Fsvg%3E\" data-orig-src=\"https:\/\/artefact.com\/\/wp-content\/uploads\/2021\/03\/medium.png\" alt class=\"lazyload img-responsive wp-image-60927\"\/><\/span><\/div><div class=\"fusion-title title fusion-title-11 fusion-sep-none fusion-title-center fusion-title-text fusion-title-size-three\" style=\"--awb-margin-top:20px;--awb-margin-bottom:0px;--awb-margin-bottom-small:8px;\"><h3 class=\"fusion-title-heading title-heading-center fusion-responsive-typography-calculated\" style=\"margin:0;--fontSize:20;line-height:1.2;\">Medium Blog bij Artefact.<\/h3><\/div><div class=\"fusion-text fusion-text-14\" style=\"--awb-content-alignment:center;\"><p>Dit artikel werd oorspronkelijk gepubliceerd op Medium.com.<br \/>\nVolg ons op ons medium Blog !<\/p>\n<\/div><div style=\"text-align:center;\"><a class=\"fusion-button button-flat button-medium button-default fusion-button-default button-1 fusion-button-default-span fusion-button-default-type\" style=\"--button_text_transform:var(--awb-custom_typography_2-text-transform);--button_typography-letter-spacing:var(--awb-custom_typography_2-letter-spacing);--button_typography-font-family:var(--awb-custom_typography_2-font-family);--button_typography-font-weight:var(--awb-custom_typography_2-font-weight);--button_typography-font-style:var(--awb-custom_typography_2-font-style);\" target=\"_blank\" rel=\"noopener noreferrer\" data-hover=\"text_slide_down\" href=\"https:\/\/medium.com\/artefact-engineering-and-data-science\/why-you-need-llmops-48c0925827de#c82e-c015f09e2d46\"><div class=\"awb-button-text-transition  awb-button__hover-content--centered\"><span class=\"fusion-button-text awb-button__text awb-button__text--default\">Lees ons artikel<\/span><span class=\"fusion-button-text awb-button__text awb-button__text--default\">Lees ons artikel<\/span><\/div><\/a><\/div><\/div><\/div><\/div><\/div><\/p>","protected":false},"excerpt":{"rendered":"<p>In dit artikel wordt LLMOps ge\u00efntroduceerd, een gespecialiseerde tak die DevOps en MLOps samenbrengt om de uitdagingen aan te gaan die grote taalmodellen (LLM\u2019s) met zich meebrengen\u2026<\/p>","protected":false},"featured_media":95614,"parent":0,"template":"","meta":{"_acf_changed":false,"ep_exclude_from_search":false},"blog-category":[2995,21939],"blog-language":[2991],"class_list":["post-95610","blog","type-blog","status-publish","has-post-thumbnail","hentry","blog-category-ai-technology","blog-category-medium","blog-language-en"],"acf":[],"_links":{"self":[{"href":"https:\/\/www.artefact.com\/nl\/wp-json\/wp\/v2\/blog\/95610","targetHints":{"allow":["GET"]}}],"collection":[{"href":"https:\/\/www.artefact.com\/nl\/wp-json\/wp\/v2\/blog"}],"about":[{"href":"https:\/\/www.artefact.com\/nl\/wp-json\/wp\/v2\/types\/blog"}],"wp:featuredmedia":[{"embeddable":true,"href":"https:\/\/www.artefact.com\/nl\/wp-json\/wp\/v2\/media\/95614"}],"wp:attachment":[{"href":"https:\/\/www.artefact.com\/nl\/wp-json\/wp\/v2\/media?parent=95610"}],"wp:term":[{"taxonomy":"blog-category","embeddable":true,"href":"https:\/\/www.artefact.com\/nl\/wp-json\/wp\/v2\/blog-category?post=95610"},{"taxonomy":"blog-language","embeddable":true,"href":"https:\/\/www.artefact.com\/nl\/wp-json\/wp\/v2\/blog-language?post=95610"}],"curies":[{"name":"wp","href":"https:\/\/api.w.org\/{rel}","templated":true}]}}