	{"id":71574,"date":"2023-07-05T14:29:02","date_gmt":"2023-07-05T13:29:02","guid":{"rendered":"https:\/\/www.artefact.com\/?post_type=blog&#038;p=71574"},"modified":"2024-09-20T17:45:57","modified_gmt":"2024-09-20T16:45:57","slug":"encoding-categorical-features-in-forecasting-are-we-all-doing-it-wrong","status":"publish","type":"blog","link":"https:\/\/www.artefact.com\/fr\/blog\/encoding-categorical-features-in-forecasting-are-we-all-doing-it-wrong\/","title":{"rendered":"Encodage des caract\u00e9ristiques cat\u00e9gorielles dans les pr\u00e9visions : sommes-nous tous dans l'erreur ?"},"content":{"rendered":"<p><div class=\"fusion-fullwidth fullwidth-box fusion-builder-row-1 fusion-flex-container nonhundred-percent-fullwidth non-hundred-percent-height-scrolling article-author\" style=\"--awb-border-radius-top-left:0px;--awb-border-radius-top-right:0px;--awb-border-radius-bottom-right:0px;--awb-border-radius-bottom-left:0px;--awb-background-color:#ffffff;--awb-flex-wrap:wrap;\" ><div class=\"fusion-builder-row fusion-row fusion-flex-align-items-flex-start fusion-flex-content-wrap\" style=\"max-width:calc( 1440px + 20px );margin-left: calc(-20px \/ 2 );margin-right: calc(-20px \/ 2 );\"><div class=\"fusion-layout-column fusion_builder_column fusion-builder-column-0 fusion_builder_column_1_2 1_2 fusion-flex-column\" style=\"--awb-bg-size:cover;--awb-width-large:50%;--awb-margin-top-large:0px;--awb-spacing-right-large:10px;--awb-margin-bottom-large:0px;--awb-spacing-left-large:10px;--awb-width-medium:50%;--awb-order-medium:0;--awb-spacing-right-medium:10px;--awb-spacing-left-medium:10px;--awb-width-small:100%;--awb-order-small:0;--awb-spacing-right-small:10px;--awb-spacing-left-small:10px;\"><div class=\"fusion-column-wrapper fusion-column-has-shadow fusion-flex-justify-content-flex-start fusion-content-layout-column\"><div class=\"fusion-title title fusion-title-1 fusion-sep-none fusion-title-text fusion-title-size-two\" style=\"--awb-margin-bottom-small:8px;\"><h2 class=\"fusion-title-heading title-heading-left fusion-responsive-typography-calculated\" style=\"margin:0;--fontSize:50;line-height:1.2;\">Auteur<\/h2><\/div><img decoding=\"async\" src=\"data:image\/svg+xml,%3Csvg%20xmlns%3D%27http%3A%2F%2Fwww.w3.org%2F2000%2Fsvg%27%20width%3D%27150%27%20height%3D%270%27%20viewBox%3D%270%200%20150%200%27%3E%3Crect%20width%3D%27150%27%20height%3D%270%27%20fill-opacity%3D%220%22%2F%3E%3C%2Fsvg%3E\" data-orig-src=\"https:\/\/www.artefact.com\/\/wp-content\/uploads\/2023\/07\/Youssef-Oudghiri.jpeg\" alt=\"Image\" class=\"lazyload artefact-elegant-image align-left article-author-image\" style=\"width: 150px; border-radius: 54% 46% 77% 23% \/ 74% 40% 60% 26%; overflow: hidden;\" width=\"150\" height=\"auto\" \/><div class=\"fusion-title title fusion-title-2 fusion-sep-none fusion-title-text fusion-title-size-three article-author-name-title\" style=\"--awb-margin-bottom-small:8px;\"><h3 class=\"fusion-title-heading title-heading-left fusion-responsive-typography-calculated\" style=\"margin:0;--fontSize:20;line-height:1.2;\">Youssef Oudghiri<\/h3><\/div><div class=\"fusion-text fusion-text-1 article-author-description\" style=\"--awb-text-transform:none;\"><p>Data Scientifique \u00e0 Artefact France<\/p>\n<\/div><\/div><\/div><\/div><\/div><div class=\"fusion-fullwidth fullwidth-box fusion-builder-row-2 fusion-flex-container nonhundred-percent-fullwidth non-hundred-percent-height-scrolling\" style=\"--awb-border-radius-top-left:0px;--awb-border-radius-top-right:0px;--awb-border-radius-bottom-right:0px;--awb-border-radius-bottom-left:0px;--awb-flex-wrap:wrap;\" ><div class=\"fusion-builder-row fusion-row fusion-flex-align-items-flex-start fusion-flex-content-wrap\" style=\"max-width:calc( 1440px + 20px );margin-left: calc(-20px \/ 2 );margin-right: calc(-20px \/ 2 );\"><div class=\"fusion-layout-column fusion_builder_column fusion-builder-column-1 fusion_builder_column_1_1 1_1 fusion-flex-column\" style=\"--awb-bg-size:cover;--awb-width-large:100%;--awb-margin-top-large:0px;--awb-spacing-right-large:10px;--awb-margin-bottom-large:0px;--awb-spacing-left-large:10px;--awb-width-medium:100%;--awb-order-medium:0;--awb-spacing-right-medium:10px;--awb-spacing-left-medium:10px;--awb-width-small:100%;--awb-order-small:0;--awb-spacing-right-small:10px;--awb-spacing-left-small:10px;\"><div class=\"fusion-column-wrapper fusion-column-has-shadow fusion-flex-justify-content-flex-start fusion-content-layout-column\"><div class=\"fusion-text fusion-text-2 description\"><p>Nous proposons une nouvelle m\u00e9thode d'encodage des caract\u00e9ristiques cat\u00e9gorielles sp\u00e9cifiquement adapt\u00e9e aux applications de pr\u00e9vision. Essentiellement, cette approche code les caract\u00e9ristiques cat\u00e9gorielles en mod\u00e9lisant la tendance des quantit\u00e9s associ\u00e9es \u00e0 chaque cat\u00e9gorie. Dans nos exp\u00e9riences, cette approche pr\u00e9sente des avantages substantiels en termes de performance - \u00e0 la fois en termes de pr\u00e9cision et de biais de pr\u00e9vision - car elle permet aux mod\u00e8les d'ensemble bas\u00e9s sur des arbres de mieux mod\u00e9liser et extrapoler les tendances.<\/p>\n<\/div><\/div><\/div><\/div><\/div><div class=\"fusion-fullwidth fullwidth-box fusion-builder-row-3 fusion-flex-container nonhundred-percent-fullwidth non-hundred-percent-height-scrolling\" style=\"--awb-border-radius-top-left:0px;--awb-border-radius-top-right:0px;--awb-border-radius-bottom-right:0px;--awb-border-radius-bottom-left:0px;--awb-margin-top:40px;--awb-margin-bottom:40px;--awb-flex-wrap:wrap;\" ><div class=\"fusion-builder-row fusion-row fusion-flex-align-items-center fusion-flex-justify-content-center fusion-flex-content-wrap\" style=\"max-width:calc( 1440px + 20px );margin-left: calc(-20px \/ 2 );margin-right: calc(-20px \/ 2 );\"><div class=\"fusion-layout-column fusion_builder_column fusion-builder-column-2 fusion_builder_column_1_1 1_1 fusion-flex-column fusion-flex-align-self-center fusion-column-inner-bg-wrapper\" style=\"--awb-padding-top:20px;--awb-padding-right:20px;--awb-padding-bottom:20px;--awb-padding-left:20px;--awb-overflow:hidden;--awb-inner-bg-size:cover;--awb-border-color:rgba(10,17,40,0.1);--awb-border-top:1px;--awb-border-right:1px;--awb-border-bottom:1px;--awb-border-left:1px;--awb-border-style:solid;--awb-border-radius:4px 4px 4px 4px;--awb-inner-bg-border-radius:4px 4px 4px 4px;--awb-inner-bg-overflow:hidden;--awb-width-large:100%;--awb-margin-top-large:0px;--awb-spacing-right-large:10px;--awb-margin-bottom-large:0px;--awb-spacing-left-large:10px;--awb-width-medium:100%;--awb-order-medium:0;--awb-spacing-right-medium:10px;--awb-spacing-left-medium:10px;--awb-width-small:100%;--awb-order-small:0;--awb-spacing-right-small:10px;--awb-spacing-left-small:10px;\" data-scroll-devices=\"small-visibility,medium-visibility,large-visibility\"><span class=\"fusion-column-inner-bg hover-type-none\"><a class=\"fusion-column-anchor\" href=\"https:\/\/medium.com\/artefact-engineering-and-data-science\/encoding-categorical-features-in-forecasting-are-we-all-doing-it-wrong-fe8a9a6488da\" rel=\"noopener noreferrer\" target=\"_blank\"><span class=\"fusion-column-inner-bg-image\"><\/span><\/a><\/span><div class=\"fusion-column-wrapper fusion-column-has-shadow fusion-flex-justify-content-center fusion-content-layout-row fusion-flex-align-items-center\"><div class=\"fusion-text fusion-text-3\"><p><u>Lisez notre article sur<\/u><\/p>\n<\/div><div class=\"fusion-image-element\" style=\"--awb-margin-right:20px;--awb-margin-left:20px;--awb-max-width:150px;--awb-caption-title-font-family:var(--h2_typography-font-family);--awb-caption-title-font-weight:var(--h2_typography-font-weight);--awb-caption-title-font-style:var(--h2_typography-font-style);--awb-caption-title-size:var(--h2_typography-font-size);--awb-caption-title-transform:var(--h2_typography-text-transform);--awb-caption-title-line-height:var(--h2_typography-line-height);--awb-caption-title-letter-spacing:var(--h2_typography-letter-spacing);\"><span class=\"fusion-imageframe imageframe-none imageframe-1 hover-type-none\"><img decoding=\"async\" width=\"4000\" height=\"992\" title=\"Moyen Blog\" src=\"https:\/\/www.artefact.com\/\/wp-content\/uploads\/2021\/04\/Medium-Blog.png\" data-orig-src=\"https:\/\/www.artefact.com\/\/wp-content\/uploads\/2021\/04\/Medium-Blog.png\" alt class=\"lazyload img-responsive wp-image-60582\" srcset=\"data:image\/svg+xml,%3Csvg%20xmlns%3D%27http%3A%2F%2Fwww.w3.org%2F2000%2Fsvg%27%20width%3D%274000%27%20height%3D%27992%27%20viewBox%3D%270%200%204000%20992%27%3E%3Crect%20width%3D%274000%27%20height%3D%27992%27%20fill-opacity%3D%220%22%2F%3E%3C%2Fsvg%3E\" data-srcset=\"https:\/\/www.artefact.com\/\/wp-content\/uploads\/2021\/04\/Medium-Blog-200x50.png 200w, https:\/\/www.artefact.com\/\/wp-content\/uploads\/2021\/04\/Medium-Blog-400x99.png 400w, https:\/\/www.artefact.com\/\/wp-content\/uploads\/2021\/04\/Medium-Blog-600x149.png 600w, https:\/\/www.artefact.com\/\/wp-content\/uploads\/2021\/04\/Medium-Blog-800x198.png 800w, https:\/\/www.artefact.com\/\/wp-content\/uploads\/2021\/04\/Medium-Blog-1200x298.png 1200w, https:\/\/www.artefact.com\/\/wp-content\/uploads\/2021\/04\/Medium-Blog.png 4000w\" data-sizes=\"auto\" data-orig-sizes=\"(max-width: 640px) 100vw, 4000px\" \/><\/span><\/div><div class=\"fusion-text fusion-text-4\"><p>.<\/p>\n<\/div><\/div><\/div><\/div><\/div><article class=\"fusion-fullwidth fullwidth-box fusion-builder-row-4 fusion-flex-container nonhundred-percent-fullwidth non-hundred-percent-height-scrolling\" style=\"--awb-border-radius-top-left:0px;--awb-border-radius-top-right:0px;--awb-border-radius-bottom-right:0px;--awb-border-radius-bottom-left:0px;--awb-flex-wrap:wrap;\" ><div class=\"fusion-builder-row fusion-row fusion-flex-align-items-flex-start fusion-flex-justify-content-center fusion-flex-content-wrap\" style=\"max-width:calc( 1440px + 20px );margin-left: calc(-20px \/ 2 );margin-right: calc(-20px \/ 2 );\"><div class=\"fusion-layout-column fusion_builder_column fusion-builder-column-3 fusion_builder_column_1_1 1_1 fusion-flex-column\" style=\"--awb-bg-size:cover;--awb-width-large:100%;--awb-margin-top-large:0px;--awb-spacing-right-large:10px;--awb-margin-bottom-large:0px;--awb-spacing-left-large:10px;--awb-width-medium:100%;--awb-order-medium:0;--awb-spacing-right-medium:10px;--awb-spacing-left-medium:10px;--awb-width-small:100%;--awb-order-small:0;--awb-spacing-right-small:10px;--awb-spacing-left-small:10px;\"><div class=\"fusion-column-wrapper fusion-column-has-shadow fusion-flex-justify-content-flex-start fusion-content-layout-column\"><div class=\"fusion-title title fusion-title-3 fusion-sep-none fusion-title-text fusion-title-size-two\" style=\"--awb-margin-bottom-small:8px;\"><h2 class=\"fusion-title-heading title-heading-left fusion-responsive-typography-calculated\" style=\"margin:0;--fontSize:50;line-height:1.2;\">Introduction<\/h2><\/div><div class=\"fusion-text fusion-text-5\"><p>Ce travail a \u00e9t\u00e9 motiv\u00e9 par de nombreux projets de pr\u00e9vision de clients au Artefact, pour lesquels nos mod\u00e8les de stimulation pr\u00e9sentaient un biais important au moment de la pr\u00e9diction. Gr\u00e2ce \u00e0 une phase de diagnostic, nous avons identifi\u00e9 que l'une des principales sources de biais dans les mod\u00e8les d'apprentissage ensembliste provenait de leurs difficult\u00e9s \u00e0 mod\u00e9liser avec pr\u00e9cision les tendances et les niveaux fluctuants.<\/p>\n<p>Dans ce qui suit, nous allons d\u00e9montrer\u00a0<strong>pourquoi<\/strong>\u00a0et\u00a0<strong>comment<\/strong>\u00a0nous avons utilis\u00e9 une nouvelle approche pour coder les caract\u00e9ristiques cat\u00e9gorielles. Sur la base de nos exp\u00e9riences portant sur un projet de pr\u00e9vision du commerce de d\u00e9tail d'un client et sur divers ensembles data publics, nous prouvons que cette technique peut effectivement att\u00e9nuer les biais et am\u00e9liorer la pr\u00e9cision.<\/p>\n<\/div><div class=\"fusion-title title fusion-title-4 fusion-sep-none fusion-title-text fusion-title-size-two\" style=\"--awb-margin-bottom-small:8px;\"><h2 class=\"fusion-title-heading title-heading-left fusion-responsive-typography-calculated\" style=\"margin:0;--fontSize:50;line-height:1.2;\">Boosting et tendances, pourquoi est-ce complexe ?<\/h2><\/div><div class=\"fusion-title title fusion-title-5 fusion-sep-none fusion-title-text fusion-title-size-three\" style=\"--awb-margin-bottom-small:8px;\"><h3 class=\"fusion-title-heading title-heading-left fusion-responsive-typography-calculated\" style=\"margin:0;--fontSize:20;line-height:1.2;\">Les algorithmes de boosting ont du mal \u00e0 extrapoler<\/h3><\/div><div class=\"fusion-text fusion-text-6\"><p>Les algorithmes de boosting ont du mal \u00e0 mod\u00e9liser et \u00e0 extrapoler les tendances car ils ne peuvent pas pr\u00e9dire de nouvelles valeurs qui n'ont pas \u00e9t\u00e9 observ\u00e9es dans l'ensemble d'apprentissage ou qui sont absentes des feuilles. \u201c<a href=\"https:\/\/pypi.org\/project\/linear-tree\/\" target=\"_blank\" rel=\"noopener ugc nofollow\">Arbre lin\u00e9aire<\/a>\u201dLes mod\u00e8les de la s\u00e9rie \" C \" tentent de rem\u00e9dier \u00e0 ce probl\u00e8me, mais nos tests n'ont pas donn\u00e9 de r\u00e9sultats concluants avec cette m\u00e9thode.<\/p>\n<\/div><div class=\"fusion-title title fusion-title-6 fusion-sep-none fusion-title-text fusion-title-size-three\" style=\"--awb-margin-bottom-small:8px;\"><h3 class=\"fusion-title-heading title-heading-left fusion-responsive-typography-calculated\" style=\"margin:0;--fontSize:20;line-height:1.2;\">Les encodages classiques favorisent les pr\u00e9dictions statiques<\/h3><\/div><div class=\"fusion-text fusion-text-7\"><p>Les m\u00e9thodes d'encodage les plus courantes employ\u00e9es dans le cadre du boosting favorisent les relations statiques entre les variables ind\u00e9pendantes et d\u00e9pendantes, ce qui contribue \u00e0 accro\u00eetre le biais en pr\u00e9sence de tendances. Le diagramme ci-dessous illustre ce ph\u00e9nom\u00e8ne :<\/p>\n<\/div><div class=\"fusion-image-element\" style=\"--awb-caption-title-font-family:var(--h2_typography-font-family);--awb-caption-title-font-weight:var(--h2_typography-font-weight);--awb-caption-title-font-style:var(--h2_typography-font-style);--awb-caption-title-size:var(--h2_typography-font-size);--awb-caption-title-transform:var(--h2_typography-text-transform);--awb-caption-title-line-height:var(--h2_typography-line-height);--awb-caption-title-letter-spacing:var(--h2_typography-letter-spacing);\"><span class=\"fusion-imageframe imageframe-none imageframe-2 hover-type-none\"><img decoding=\"async\" width=\"1400\" height=\"477\" alt=\"Classical encodings push towards static predictions\" title=\"Les encodages classiques favorisent les pr\u00e9dictions statiques\" src=\"https:\/\/www.artefact.com\/\/wp-content\/uploads\/2023\/07\/Classical-encodings-push-towards-static-predictions.webp\" data-orig-src=\"https:\/\/www.artefact.com\/\/wp-content\/uploads\/2023\/07\/Classical-encodings-push-towards-static-predictions.webp\" class=\"lazyload img-responsive wp-image-71578\" srcset=\"data:image\/svg+xml,%3Csvg%20xmlns%3D%27http%3A%2F%2Fwww.w3.org%2F2000%2Fsvg%27%20width%3D%271400%27%20height%3D%27477%27%20viewBox%3D%270%200%201400%20477%27%3E%3Crect%20width%3D%271400%27%20height%3D%27477%27%20fill-opacity%3D%220%22%2F%3E%3C%2Fsvg%3E\" data-srcset=\"https:\/\/www.artefact.com\/\/wp-content\/uploads\/2023\/07\/Classical-encodings-push-towards-static-predictions-200x68.webp 200w, https:\/\/www.artefact.com\/\/wp-content\/uploads\/2023\/07\/Classical-encodings-push-towards-static-predictions-400x136.webp 400w, https:\/\/www.artefact.com\/\/wp-content\/uploads\/2023\/07\/Classical-encodings-push-towards-static-predictions-600x204.webp 600w, https:\/\/www.artefact.com\/\/wp-content\/uploads\/2023\/07\/Classical-encodings-push-towards-static-predictions-800x273.webp 800w, https:\/\/www.artefact.com\/\/wp-content\/uploads\/2023\/07\/Classical-encodings-push-towards-static-predictions-1200x409.webp 1200w, https:\/\/www.artefact.com\/\/wp-content\/uploads\/2023\/07\/Classical-encodings-push-towards-static-predictions.webp 1400w\" data-sizes=\"auto\" data-orig-sizes=\"(max-width: 640px) 100vw, 1400px\" \/><\/span><\/div><div class=\"fusion-text fusion-text-8\"><p style=\"text-align: center;\"><em>Repr\u00e9sentation visuelle simplifi\u00e9e mettant en \u00e9vidence la nature statique de l'encodage des caract\u00e9ristiques cat\u00e9gorielles utilis\u00e9 dans les algorithmes de boosting.<\/em><\/p>\n<\/div><div class=\"fusion-text fusion-text-9\"><p>Nous reconnaissons que la repr\u00e9sentation ci-dessus est une simplification excessive, car les arbres de d\u00e9cision sont plus complexes et capables d'identifier des relations non lin\u00e9aires bas\u00e9es sur des facteurs multiples. En effet, la condition \u201cla couleur est noire\u201d pourrait \u00eatre associ\u00e9e \u00e0 \u201cle mois de juin\u201d. Dans ce cas, la couleur noire n'aurait pas le m\u00eame impact \u00e0 tout moment. Mais regardons la situation dans son ensemble :<\/p>\n<\/div><ul style=\"--awb-line-height:27.2px;--awb-icon-width:27.2px;--awb-icon-height:27.2px;--awb-icon-margin:11.2px;--awb-content-margin:38.4px;\" class=\"fusion-checklist fusion-checklist-1 fusion-checklist-default type-icons paddingList dark-text\"><li class=\"fusion-li-item\" style=\"\"><span class=\"icon-wrapper circle-no\"><i class=\"fusion-li-icon awb-icon-check\" aria-hidden=\"true\"><\/i><\/span><div class=\"fusion-li-item-content\">\n<p>L'attribution d'un impact unique pour la couleur noire en juin n'est toujours pas id\u00e9ale, car l'impact en juin 2021 peut diff\u00e9rer de l'impact en juin 2022. M\u00eame si nous incluons l'ann\u00e9e, la fronti\u00e8re d\u00e9cisionnelle deviendrait trop complexe \u00e0 construire et \u00e0 identifier, mais aussi, que se passerait-il si la formation data se termine en 2022 et que des pr\u00e9visions doivent \u00eatre faites pour 2023 ?<\/p>\n<\/div><\/li><li class=\"fusion-li-item\" style=\"\"><span class=\"icon-wrapper circle-no\"><i class=\"fusion-li-icon awb-icon-check\" aria-hidden=\"true\"><\/i><\/span><div class=\"fusion-li-item-content\">\n<p><strong>L'ing\u00e9nierie des caract\u00e9ristiques a pour but d'aider le mod\u00e8le \u00e0 identifier plus facilement les relations.<\/strong>. Si nous pouvons aider le mod\u00e8le \u00e0 associer l'impact de la couleur noire \u00e0 un moment donn\u00e9 sans n\u00e9cessiter l'identification de relations complexes, cela serait tr\u00e8s avantageux pour le mod\u00e8le. D'o\u00f9 ...<\/p>\n<\/div><\/li><\/ul><div class=\"fusion-title title fusion-title-7 fusion-sep-none fusion-title-text fusion-title-size-two\" style=\"--awb-margin-bottom-small:8px;\"><h2 class=\"fusion-title-heading title-heading-left fusion-responsive-typography-calculated\" style=\"margin:0;--fontSize:50;line-height:1.2;\">Notre nouvelle approche : Encodage dynamique des caract\u00e9ristiques cat\u00e9gorielles<\/h2><\/div><div class=\"fusion-title title fusion-title-8 fusion-sep-none fusion-title-text fusion-title-size-three\" style=\"--awb-margin-bottom-small:8px;\"><h3 class=\"fusion-title-heading title-heading-left fusion-responsive-typography-calculated\" style=\"margin:0;--fontSize:20;line-height:1.2;\">Base de l'encodage dynamique (v1 sans niveau de l'item)<\/h3><\/div><div class=\"fusion-text fusion-text-10\"><p id=\"9918\" data-selectable-paragraph=\"\">En une phrase, notre m\u00e9thode d'encodage des caract\u00e9ristiques cat\u00e9gorielles pourrait \u00eatre d\u00e9crite comme suit :\u00a0<strong>nous mod\u00e9lisons la composante tendancielle de chaque cat\u00e9gorie et utilisons ces valeurs tendancielles pour coder cette caract\u00e9ristique cat\u00e9gorielle<\/strong>.<\/p>\n<p id=\"b6c1\" data-selectable-paragraph=\"\">Le diagramme ci-dessous illustre la diff\u00e9rence entre l'encodage d'une moyenne statique et l'encodage d'une tendance pour deux cat\u00e9gories de couleurs : le noir et l'or.<\/p>\n<\/div><div class=\"fusion-image-element\" style=\"--awb-caption-title-font-family:var(--h2_typography-font-family);--awb-caption-title-font-weight:var(--h2_typography-font-weight);--awb-caption-title-font-style:var(--h2_typography-font-style);--awb-caption-title-size:var(--h2_typography-font-size);--awb-caption-title-transform:var(--h2_typography-text-transform);--awb-caption-title-line-height:var(--h2_typography-line-height);--awb-caption-title-letter-spacing:var(--h2_typography-letter-spacing);\"><span class=\"fusion-imageframe imageframe-none imageframe-3 hover-type-none\"><img decoding=\"async\" width=\"1400\" height=\"453\" title=\"Base de l&#039;encodage dynamique (v1 sans niveau de l&#039;item)\" src=\"https:\/\/www.artefact.com\/\/wp-content\/uploads\/2023\/07\/Basis-of-dynamic-encoding-v1-without-item-level.webp\" data-orig-src=\"https:\/\/www.artefact.com\/\/wp-content\/uploads\/2023\/07\/Basis-of-dynamic-encoding-v1-without-item-level.webp\" alt class=\"lazyload img-responsive wp-image-71579\" srcset=\"data:image\/svg+xml,%3Csvg%20xmlns%3D%27http%3A%2F%2Fwww.w3.org%2F2000%2Fsvg%27%20width%3D%271400%27%20height%3D%27453%27%20viewBox%3D%270%200%201400%20453%27%3E%3Crect%20width%3D%271400%27%20height%3D%27453%27%20fill-opacity%3D%220%22%2F%3E%3C%2Fsvg%3E\" data-srcset=\"https:\/\/www.artefact.com\/\/wp-content\/uploads\/2023\/07\/Basis-of-dynamic-encoding-v1-without-item-level-200x65.webp 200w, https:\/\/www.artefact.com\/\/wp-content\/uploads\/2023\/07\/Basis-of-dynamic-encoding-v1-without-item-level-400x129.webp 400w, https:\/\/www.artefact.com\/\/wp-content\/uploads\/2023\/07\/Basis-of-dynamic-encoding-v1-without-item-level-600x194.webp 600w, https:\/\/www.artefact.com\/\/wp-content\/uploads\/2023\/07\/Basis-of-dynamic-encoding-v1-without-item-level-800x259.webp 800w, https:\/\/www.artefact.com\/\/wp-content\/uploads\/2023\/07\/Basis-of-dynamic-encoding-v1-without-item-level-1200x388.webp 1200w, https:\/\/www.artefact.com\/\/wp-content\/uploads\/2023\/07\/Basis-of-dynamic-encoding-v1-without-item-level.webp 1400w\" data-sizes=\"auto\" data-orig-sizes=\"(max-width: 640px) 100vw, 1400px\" \/><\/span><\/div><div class=\"fusion-text fusion-text-11\"><p style=\"text-align: center;\"><em>Illustration du principe d'encodage dynamique, qui implique une mod\u00e9lisation des tendances pour chaque cat\u00e9gorie.<\/em><\/p>\n<\/div><div class=\"fusion-text fusion-text-12\"><p id=\"8d62\" data-selectable-paragraph=\"\">Dans nos exp\u00e9riences, nous avons choisi d'utiliser Prophet pour extraire la composante de tendance. Naturellement, il est possible d'envisager d'autres mod\u00e8les de pr\u00e9vision des s\u00e9ries temporelles.<\/p>\n<p id=\"5677\" data-selectable-paragraph=\"\">Notez que l'encodage moyen statique implique que les ventes d'articles noirs se situent \u00e0 un niveau moyen de 100 unit\u00e9s\/mois \u00e0 tout moment. L'encodage dynamique, en revanche, permet de tenir compte de la tendance \u00e0 la hausse observ\u00e9e pour les articles noirs et de l'extrapoler \u00e0 l'avenir. Une constatation similaire peut \u00eatre faite en ce qui concerne les articles en or. Ainsi, notre approche sera particuli\u00e8rement utile dans les ensembles data o\u00f9 la variable cible \u00e0 pr\u00e9voir suit des tendances marqu\u00e9es dans les diff\u00e9rentes cat\u00e9gories disponibles.<\/p>\n<p id=\"096b\" data-selectable-paragraph=\"\">Notre objectif principal est de permettre au mod\u00e8le de s'adapter plus facilement aux relations changeantes entre les variables ind\u00e9pendantes et la variable d\u00e9pendante \u00e0 pr\u00e9voir. Par cons\u00e9quent, cette m\u00e9thode d'encodage dynamique pourrait \u00e9galement \u00eatre appliqu\u00e9e aux caract\u00e9ristiques num\u00e9riques. Prenons l'exemple du prix. Bien que le prix soit num\u00e9rique et que le mod\u00e8le puisse directement construire des r\u00e8gles bas\u00e9es sur lui, les pr\u00e9f\u00e9rences des gens pour les articles bon march\u00e9 ou co\u00fbteux peuvent encore \u00e9voluer dans le temps et suivre une tendance de vente sp\u00e9cifique. Dans le contexte d'une crise \u00e9conomique, par exemple, les produits abordables peuvent suivre une tendance \u00e0 la hausse des ventes, tandis que les produits on\u00e9reux peuvent suivre une tendance \u00e0 la baisse. En consid\u00e9rant \u2018abordable\u2019 comme une cat\u00e9gorie et \u2018cher\u2019 comme une autre, nous pourrions proposer un codage dynamique pour la caract\u00e9ristique du prix, comme nous l'avons fait pour les couleurs.<\/p>\n<p id=\"671a\" data-selectable-paragraph=\"\">Il est important de noter que pour les caract\u00e9ristiques num\u00e9riques, les variables de base et les variables cod\u00e9es dynamiquement peuvent \u00eatre utilis\u00e9es dans le mod\u00e8le, car elles fourniront diff\u00e9rents types d'informations.<\/p>\n<\/div><div class=\"fusion-title title fusion-title-9 fusion-sep-none fusion-title-text fusion-title-size-three\" style=\"--awb-margin-bottom-small:8px;\"><h3 class=\"fusion-title-heading title-heading-left fusion-responsive-typography-calculated\" style=\"margin:0;--fontSize:20;line-height:1.2;\">Accorder plus d'importance aux caract\u00e9ristiques dynamiques (v2 avec niveau de l'\u00e9l\u00e9ment)<\/h3><\/div><div class=\"fusion-text fusion-text-13\"><p id=\"c481\" data-selectable-paragraph=\"\">Bien que cette nouvelle m\u00e9thode d'encodage constitue une am\u00e9lioration, il arrive souvent que l'importance des caract\u00e9ristiques cat\u00e9gorielles ne soit pas suffisamment \u00e9lev\u00e9e pour avoir un impact significatif sur les pr\u00e9dictions lors de l'examen de l'importance des caract\u00e9ristiques. Pour donner plus d'importance aux caract\u00e9ristiques dynamiques et favoriser ainsi une meilleure mod\u00e9lisation et extrapolation des tendances, nous adaptons les valeurs d'encodage \u00e0 chaque s\u00e9rie chronologique\/\u00e9l\u00e9ment individuellement.<\/p>\n<\/div><div class=\"fusion-image-element\" style=\"--awb-caption-title-font-family:var(--h2_typography-font-family);--awb-caption-title-font-weight:var(--h2_typography-font-weight);--awb-caption-title-font-style:var(--h2_typography-font-style);--awb-caption-title-size:var(--h2_typography-font-size);--awb-caption-title-transform:var(--h2_typography-text-transform);--awb-caption-title-line-height:var(--h2_typography-line-height);--awb-caption-title-letter-spacing:var(--h2_typography-letter-spacing);\"><span class=\"fusion-imageframe imageframe-none imageframe-4 hover-type-none\"><img decoding=\"async\" width=\"1342\" height=\"296\" title=\"Accorder plus d&#039;importance aux caract\u00e9ristiques dynamiques (v2 avec niveau de l&#039;\u00e9l\u00e9ment)\" src=\"https:\/\/www.artefact.com\/\/wp-content\/uploads\/2023\/07\/Giving-more-importance-to-dynamic-features-v2-with-item-level.webp\" data-orig-src=\"https:\/\/www.artefact.com\/\/wp-content\/uploads\/2023\/07\/Giving-more-importance-to-dynamic-features-v2-with-item-level.webp\" alt class=\"lazyload img-responsive wp-image-71580\" srcset=\"data:image\/svg+xml,%3Csvg%20xmlns%3D%27http%3A%2F%2Fwww.w3.org%2F2000%2Fsvg%27%20width%3D%271342%27%20height%3D%27296%27%20viewBox%3D%270%200%201342%20296%27%3E%3Crect%20width%3D%271342%27%20height%3D%27296%27%20fill-opacity%3D%220%22%2F%3E%3C%2Fsvg%3E\" data-srcset=\"https:\/\/www.artefact.com\/\/wp-content\/uploads\/2023\/07\/Giving-more-importance-to-dynamic-features-v2-with-item-level-200x44.webp 200w, https:\/\/www.artefact.com\/\/wp-content\/uploads\/2023\/07\/Giving-more-importance-to-dynamic-features-v2-with-item-level-400x88.webp 400w, https:\/\/www.artefact.com\/\/wp-content\/uploads\/2023\/07\/Giving-more-importance-to-dynamic-features-v2-with-item-level-600x132.webp 600w, https:\/\/www.artefact.com\/\/wp-content\/uploads\/2023\/07\/Giving-more-importance-to-dynamic-features-v2-with-item-level-800x176.webp 800w, https:\/\/www.artefact.com\/\/wp-content\/uploads\/2023\/07\/Giving-more-importance-to-dynamic-features-v2-with-item-level-1200x265.webp 1200w, https:\/\/www.artefact.com\/\/wp-content\/uploads\/2023\/07\/Giving-more-importance-to-dynamic-features-v2-with-item-level.webp 1342w\" data-sizes=\"auto\" data-orig-sizes=\"(max-width: 640px) 100vw, 1342px\" \/><\/span><\/div><div class=\"fusion-text fusion-text-14\"><p style=\"text-align: center;\"><em>Formule repr\u00e9sentant les deux composantes de l'encodage dynamique : le niveau de la cat\u00e9gorie et le niveau de l'\u00e9l\u00e9ment.<\/em><\/p>\n<\/div><div class=\"fusion-text fusion-text-15\"><p>Pour en revenir \u00e0 notre exemple de couleur, si l'on consid\u00e8re deux articles noirs diff\u00e9rents, l'encodage dynamique de la cat\u00e9gorie \u201cnoir\u201d pour chaque article peut \u00eatre diff\u00e9rent en fonction de ses ventes ant\u00e9rieures.<\/p>\n<\/div><div class=\"fusion-image-element\" style=\"--awb-caption-title-font-family:var(--h2_typography-font-family);--awb-caption-title-font-weight:var(--h2_typography-font-weight);--awb-caption-title-font-style:var(--h2_typography-font-style);--awb-caption-title-size:var(--h2_typography-font-size);--awb-caption-title-transform:var(--h2_typography-text-transform);--awb-caption-title-line-height:var(--h2_typography-line-height);--awb-caption-title-letter-spacing:var(--h2_typography-letter-spacing);\"><span class=\"fusion-imageframe imageframe-none imageframe-5 hover-type-none\"><img decoding=\"async\" width=\"1400\" height=\"587\" alt=\"Table illustrating the calculation of dynamic encoding through a simple example\" title=\"Tableau illustrant le calcul de l&#039;encodage dynamique \u00e0 travers un exemple simple\" src=\"https:\/\/www.artefact.com\/\/wp-content\/uploads\/2023\/07\/Table-illustrating-the-calculation-of-dynamic-encoding-through-a-simple-example.webp\" data-orig-src=\"https:\/\/www.artefact.com\/\/wp-content\/uploads\/2023\/07\/Table-illustrating-the-calculation-of-dynamic-encoding-through-a-simple-example.webp\" class=\"lazyload img-responsive wp-image-71582\" srcset=\"data:image\/svg+xml,%3Csvg%20xmlns%3D%27http%3A%2F%2Fwww.w3.org%2F2000%2Fsvg%27%20width%3D%271400%27%20height%3D%27587%27%20viewBox%3D%270%200%201400%20587%27%3E%3Crect%20width%3D%271400%27%20height%3D%27587%27%20fill-opacity%3D%220%22%2F%3E%3C%2Fsvg%3E\" data-srcset=\"https:\/\/www.artefact.com\/\/wp-content\/uploads\/2023\/07\/Table-illustrating-the-calculation-of-dynamic-encoding-through-a-simple-example-200x84.webp 200w, https:\/\/www.artefact.com\/\/wp-content\/uploads\/2023\/07\/Table-illustrating-the-calculation-of-dynamic-encoding-through-a-simple-example-400x168.webp 400w, https:\/\/www.artefact.com\/\/wp-content\/uploads\/2023\/07\/Table-illustrating-the-calculation-of-dynamic-encoding-through-a-simple-example-600x252.webp 600w, https:\/\/www.artefact.com\/\/wp-content\/uploads\/2023\/07\/Table-illustrating-the-calculation-of-dynamic-encoding-through-a-simple-example-800x335.webp 800w, https:\/\/www.artefact.com\/\/wp-content\/uploads\/2023\/07\/Table-illustrating-the-calculation-of-dynamic-encoding-through-a-simple-example-1200x503.webp 1200w, https:\/\/www.artefact.com\/\/wp-content\/uploads\/2023\/07\/Table-illustrating-the-calculation-of-dynamic-encoding-through-a-simple-example.webp 1400w\" data-sizes=\"auto\" data-orig-sizes=\"(max-width: 640px) 100vw, 1400px\" \/><\/span><\/div><div class=\"fusion-text fusion-text-16\"><p style=\"text-align: center;\"><em>Tableau illustrant le calcul de l'encodage dynamique \u00e0 travers un exemple simple<\/em><\/p>\n<\/div><div class=\"fusion-title title fusion-title-10 fusion-sep-none fusion-title-text fusion-title-size-two\" style=\"--awb-margin-bottom-small:8px;\"><h2 class=\"fusion-title-heading title-heading-left fusion-responsive-typography-calculated\" style=\"margin:0;--fontSize:50;line-height:1.2;\">Exp\u00e9riences et r\u00e9sultats<\/h2><\/div><div class=\"fusion-title title fusion-title-11 fusion-sep-none fusion-title-text fusion-title-size-three\" style=\"--awb-margin-bottom-small:8px;\"><h3 class=\"fusion-title-heading title-heading-left fusion-responsive-typography-calculated\" style=\"margin:0;--fontSize:20;line-height:1.2;\">Client dataset<\/h3><\/div><div class=\"fusion-text fusion-text-17\"><p>Nous avons utilis\u00e9 notre approche pour pr\u00e9voir les ventes de l'un de nos clients du secteur de la vente au d\u00e9tail. Nous avons valid\u00e9 notre m\u00e9thode sur un large \u00e9ventail de champs d'application afin de nous assurer de son efficacit\u00e9. Voici quelques points data concernant le contexte exp\u00e9rimental :<\/p>\n<\/div><ul style=\"--awb-line-height:27.2px;--awb-icon-width:27.2px;--awb-icon-height:27.2px;--awb-icon-margin:11.2px;--awb-content-margin:38.4px;\" class=\"fusion-checklist fusion-checklist-2 fusion-checklist-default type-icons paddingList dark-text\"><li class=\"fusion-li-item\" style=\"\"><span class=\"icon-wrapper circle-no\"><i class=\"fusion-li-icon awb-icon-check\" aria-hidden=\"true\"><\/i><\/span><div class=\"fusion-li-item-content\">Les exp\u00e9riences ont \u00e9t\u00e9 men\u00e9es sur 9 produits diff\u00e9rents, avec un mod\u00e8le de renforcement (LightGBM) pour chaque produit.<\/div><\/li><li class=\"fusion-li-item\" style=\"\"><span class=\"icon-wrapper circle-no\"><i class=\"fusion-li-icon awb-icon-check\" aria-hidden=\"true\"><\/i><\/span><div class=\"fusion-li-item-content\">\n<p>Pour chaque champ d'application, une validation crois\u00e9e k-fold avec une fen\u00eatre d'expansion a \u00e9t\u00e9 r\u00e9alis\u00e9e (k=5).<\/p>\n<\/div><\/li><li class=\"fusion-li-item\" style=\"\"><span class=\"icon-wrapper circle-no\"><i class=\"fusion-li-icon awb-icon-check\" aria-hidden=\"true\"><\/i><\/span><div class=\"fusion-li-item-content\">\n<p>Horizon de pr\u00e9vision : Du jour+1 au jour+180.<\/p>\n<\/div><\/li><li class=\"fusion-li-item\" style=\"\"><span class=\"icon-wrapper circle-no\"><i class=\"fusion-li-icon awb-icon-check\" aria-hidden=\"true\"><\/i><\/span><div class=\"fusion-li-item-content\">\n<p>Les performances ont \u00e9t\u00e9 \u00e9valu\u00e9es \u00e0 l'aide de deux param\u00e8tres :<\/p>\n<\/div><\/li><\/ul><div class=\"fusion-image-element\" style=\"--awb-caption-title-font-family:var(--h2_typography-font-family);--awb-caption-title-font-weight:var(--h2_typography-font-weight);--awb-caption-title-font-style:var(--h2_typography-font-style);--awb-caption-title-size:var(--h2_typography-font-size);--awb-caption-title-transform:var(--h2_typography-text-transform);--awb-caption-title-line-height:var(--h2_typography-line-height);--awb-caption-title-letter-spacing:var(--h2_typography-letter-spacing);\"><span class=\"fusion-imageframe imageframe-none imageframe-6 hover-type-none\"><img decoding=\"async\" width=\"1400\" height=\"329\" title=\"formule\" src=\"https:\/\/www.artefact.com\/\/wp-content\/uploads\/2023\/07\/formule.webp\" data-orig-src=\"https:\/\/www.artefact.com\/\/wp-content\/uploads\/2023\/07\/formule.webp\" alt class=\"lazyload img-responsive wp-image-71583\" srcset=\"data:image\/svg+xml,%3Csvg%20xmlns%3D%27http%3A%2F%2Fwww.w3.org%2F2000%2Fsvg%27%20width%3D%271400%27%20height%3D%27329%27%20viewBox%3D%270%200%201400%20329%27%3E%3Crect%20width%3D%271400%27%20height%3D%27329%27%20fill-opacity%3D%220%22%2F%3E%3C%2Fsvg%3E\" data-srcset=\"https:\/\/www.artefact.com\/\/wp-content\/uploads\/2023\/07\/formule-200x47.webp 200w, https:\/\/www.artefact.com\/\/wp-content\/uploads\/2023\/07\/formule-400x94.webp 400w, https:\/\/www.artefact.com\/\/wp-content\/uploads\/2023\/07\/formule-600x141.webp 600w, https:\/\/www.artefact.com\/\/wp-content\/uploads\/2023\/07\/formule-800x188.webp 800w, https:\/\/www.artefact.com\/\/wp-content\/uploads\/2023\/07\/formule-1200x282.webp 1200w, https:\/\/www.artefact.com\/\/wp-content\/uploads\/2023\/07\/formule.webp 1400w\" data-sizes=\"auto\" data-orig-sizes=\"(max-width: 640px) 100vw, 1400px\" \/><\/span><\/div><div class=\"fusion-text fusion-text-18\"><p id=\"3f94\" data-selectable-paragraph=\"\">Dans l'ensemble, la m\u00e9thode s'est av\u00e9r\u00e9e tr\u00e8s efficace,\u00a0<strong>ce qui se traduit par une diminution absolue moyenne du biais de 9,82% et une augmentation absolue moyenne de la pr\u00e9cision des pr\u00e9visions de 6,29%.<\/strong>\u00a0sur les 9 champs d'application des produits et les 5 plis de validation crois\u00e9e.<\/p>\n<p id=\"22b9\" data-selectable-paragraph=\"\">La section suivante valide la pertinence de notre m\u00e9thode en la testant sur un ensemble public de data.<\/p>\n<\/div><div class=\"fusion-title title fusion-title-12 fusion-sep-none fusion-title-text fusion-title-size-three\" style=\"--awb-margin-bottom-small:8px;\"><h3 class=\"fusion-title-heading title-heading-left fusion-responsive-typography-calculated\" style=\"margin:0;--fontSize:20;line-height:1.2;\">Ventes dans les magasins publics dataset<\/h3><\/div><div class=\"fusion-text fusion-text-19\"><p id=\"fbf3\" data-selectable-paragraph=\"\">Dans cette \u00e9tude de cas simplifi\u00e9e, nous utilisons le\u00a0<a href=\"https:\/\/www.kaggle.com\/competitions\/store-sales-time-series-forecasting\/data\" target=\"_blank\" rel=\"noopener ugc nofollow\">Ventes en magasin - Pr\u00e9vision des s\u00e9ries temporelles<\/a>\u00a0Kaggle dataset. Cet ensemble dataset pr\u00e9sente une tendance abrupte lorsque l'on examine la s\u00e9rie chronologique des ventes moyennes, ce qui rend notre m\u00e9thode particuli\u00e8rement pertinente. En outre, l'horizon de pr\u00e9diction choisi est de trois mois, ce qui est suffisamment \u00e9loign\u00e9 pour b\u00e9n\u00e9ficier des capacit\u00e9s d'extrapolation de l'encodage dynamique. \u00c0 des fins de d\u00e9monstration, nous limitons le dataset au 31 mars 2016, juste avant qu'un tremblement de terre ne se produise, entra\u00eenant l'aplatissement de la courbe des ventes.<\/p>\n<p id=\"2ec5\" data-selectable-paragraph=\"\">Avant tout encodage, notre ensemble initial de data comprend environ 75% de caract\u00e9ristiques num\u00e9riques, englobant les d\u00e9calages, les moyennes mobiles, les caract\u00e9ristiques du calendrier et les \u00e9v\u00e9nements de vacances. Les 25% restantes sont des attributs cat\u00e9goriels tels que la famille de produits, le num\u00e9ro de magasin, la ville, etc.<\/p>\n<p id=\"f0f6\" data-selectable-paragraph=\"\">Deux mod\u00e8les distincts sont form\u00e9s : l'un utilise les caract\u00e9ristiques cat\u00e9gorielles encod\u00e9es dynamiquement \u00e0 l'aide de notre m\u00e9thode personnalis\u00e9e, tandis que l'autre utilise le traitement natif des caract\u00e9ristiques cat\u00e9gorielles de LightGBM.<\/p>\n<p id=\"f3b9\" data-selectable-paragraph=\"\">En comparant leurs performances, nous observons une am\u00e9lioration significative de l'approche d'encodage dynamique. Le tableau suivant fournit un r\u00e9sum\u00e9 des r\u00e9sultats :<\/p>\n<\/div><div class=\"fusion-image-element\" style=\"--awb-caption-title-font-family:var(--h2_typography-font-family);--awb-caption-title-font-weight:var(--h2_typography-font-weight);--awb-caption-title-font-style:var(--h2_typography-font-style);--awb-caption-title-size:var(--h2_typography-font-size);--awb-caption-title-transform:var(--h2_typography-text-transform);--awb-caption-title-line-height:var(--h2_typography-line-height);--awb-caption-title-letter-spacing:var(--h2_typography-letter-spacing);\"><span class=\"fusion-imageframe imageframe-none imageframe-7 hover-type-none\"><img decoding=\"async\" width=\"1222\" height=\"272\" alt=\"Comparison of RMSE, FA, and %Bias between LightGBM encoding method and dynamic encoding\" title=\"image 1\" src=\"https:\/\/www.artefact.com\/\/wp-content\/uploads\/2023\/07\/image-1.webp\" data-orig-src=\"https:\/\/www.artefact.com\/\/wp-content\/uploads\/2023\/07\/image-1.webp\" class=\"lazyload img-responsive wp-image-71584\" srcset=\"data:image\/svg+xml,%3Csvg%20xmlns%3D%27http%3A%2F%2Fwww.w3.org%2F2000%2Fsvg%27%20width%3D%271222%27%20height%3D%27272%27%20viewBox%3D%270%200%201222%20272%27%3E%3Crect%20width%3D%271222%27%20height%3D%27272%27%20fill-opacity%3D%220%22%2F%3E%3C%2Fsvg%3E\" data-srcset=\"https:\/\/www.artefact.com\/\/wp-content\/uploads\/2023\/07\/image-1-200x45.webp 200w, https:\/\/www.artefact.com\/\/wp-content\/uploads\/2023\/07\/image-1-400x89.webp 400w, https:\/\/www.artefact.com\/\/wp-content\/uploads\/2023\/07\/image-1-600x134.webp 600w, https:\/\/www.artefact.com\/\/wp-content\/uploads\/2023\/07\/image-1-800x178.webp 800w, https:\/\/www.artefact.com\/\/wp-content\/uploads\/2023\/07\/image-1-1200x267.webp 1200w, https:\/\/www.artefact.com\/\/wp-content\/uploads\/2023\/07\/image-1.webp 1222w\" data-sizes=\"auto\" data-orig-sizes=\"(max-width: 640px) 100vw, 1222px\" \/><\/span><\/div><div class=\"fusion-text fusion-text-20\"><p style=\"text-align: center;\"><em>Comparaison des valeurs RMSE, FA et %Bias entre la m\u00e9thode d'encodage LightGBM et l'encodage dynamique<\/em><\/p>\n<\/div><div class=\"fusion-image-element\" style=\"--awb-caption-title-font-family:var(--h2_typography-font-family);--awb-caption-title-font-weight:var(--h2_typography-font-weight);--awb-caption-title-font-style:var(--h2_typography-font-style);--awb-caption-title-size:var(--h2_typography-font-size);--awb-caption-title-transform:var(--h2_typography-text-transform);--awb-caption-title-line-height:var(--h2_typography-line-height);--awb-caption-title-letter-spacing:var(--h2_typography-letter-spacing);\"><span class=\"fusion-imageframe imageframe-none imageframe-8 hover-type-none\"><img decoding=\"async\" width=\"1304\" height=\"364\" title=\"image2\" src=\"https:\/\/www.artefact.com\/\/wp-content\/uploads\/2023\/07\/image2.webp\" data-orig-src=\"https:\/\/www.artefact.com\/\/wp-content\/uploads\/2023\/07\/image2.webp\" alt class=\"lazyload img-responsive wp-image-71585\" srcset=\"data:image\/svg+xml,%3Csvg%20xmlns%3D%27http%3A%2F%2Fwww.w3.org%2F2000%2Fsvg%27%20width%3D%271304%27%20height%3D%27364%27%20viewBox%3D%270%200%201304%20364%27%3E%3Crect%20width%3D%271304%27%20height%3D%27364%27%20fill-opacity%3D%220%22%2F%3E%3C%2Fsvg%3E\" data-srcset=\"https:\/\/www.artefact.com\/\/wp-content\/uploads\/2023\/07\/image2-200x56.webp 200w, https:\/\/www.artefact.com\/\/wp-content\/uploads\/2023\/07\/image2-400x112.webp 400w, https:\/\/www.artefact.com\/\/wp-content\/uploads\/2023\/07\/image2-600x167.webp 600w, https:\/\/www.artefact.com\/\/wp-content\/uploads\/2023\/07\/image2-800x223.webp 800w, https:\/\/www.artefact.com\/\/wp-content\/uploads\/2023\/07\/image2-1200x335.webp 1200w, https:\/\/www.artefact.com\/\/wp-content\/uploads\/2023\/07\/image2.webp 1304w\" data-sizes=\"auto\" data-orig-sizes=\"(max-width: 640px) 100vw, 1304px\" \/><\/span><\/div><div class=\"fusion-text fusion-text-21\"><p style=\"text-align: center;\"><em>Ventes hebdomadaires moyennes + pr\u00e9visions sur 3 mois (encodage dynamique vs m\u00e9thode d'encodage LightGBM)<br \/>\n<\/em><\/p>\n<\/div><div class=\"fusion-text fusion-text-22\"><p>Comme le montre le graphique ci-dessus, le mod\u00e8le incorporant des encodages dynamiques\u00a0<strong>saisit efficacement la tendance et l'extrapole<\/strong>, alors que le mod\u00e8le alternatif peine \u00e0 y parvenir.<\/p>\n<\/div><div class=\"fusion-title title fusion-title-13 fusion-sep-none fusion-title-text fusion-title-size-two\" style=\"--awb-margin-bottom-small:8px;\"><h2 class=\"fusion-title-heading title-heading-left fusion-responsive-typography-calculated\" style=\"margin:0;--fontSize:50;line-height:1.2;\">Utilisation et limites<\/h2><\/div><div class=\"fusion-text fusion-text-23\"><p>Notre m\u00e9thode s'av\u00e8re particuli\u00e8rement utile dans les sc\u00e9narios o\u00f9 la s\u00e9rie temporelle affiche\u00a0<strong>tendances prononc\u00e9es<\/strong>\u00a0et le\u00a0<strong>horizon de pr\u00e9vision<\/strong> est suffisamment \u00e9loign\u00e9e pour b\u00e9n\u00e9ficier de l'extrapolation des tendances. En outre, \u00e0 mesure que nous encodons et incorporons de mani\u00e8re dynamique les donn\u00e9es des\u00a0<strong>des caract\u00e9ristiques plus cat\u00e9goriques<\/strong>\u00a0avec\u00a0<strong>pr\u00e9dictif significatif<\/strong> <strong>pouvoir<\/strong>\u00a0dans le mod\u00e8le,\u00a0<strong>l'effet obtenu par notre approche sur les augmentations des pr\u00e9dictions<\/strong>. Cependant, il est important de reconna\u00eetre que d'autres m\u00e9thodes d'encodage ont leurs propres avantages et peuvent \u00eatre plus avantageuses dans diff\u00e9rents contextes. En outre, il est possible de combiner les deux types d'encodage pour obtenir des r\u00e9sultats potentiellement meilleurs.<\/p>\n<\/div><div class=\"fusion-title title fusion-title-14 fusion-sep-none fusion-title-text fusion-title-size-two\" style=\"--awb-margin-bottom-small:8px;\"><h2 class=\"fusion-title-heading title-heading-left fusion-responsive-typography-calculated\" style=\"margin:0;--fontSize:50;line-height:1.2;\">Conclusion<\/h2><\/div><ul style=\"--awb-line-height:27.2px;--awb-icon-width:27.2px;--awb-icon-height:27.2px;--awb-icon-margin:11.2px;--awb-content-margin:38.4px;\" class=\"fusion-checklist fusion-checklist-3 fusion-checklist-default type-icons paddingList dark-text\"><li class=\"fusion-li-item\" style=\"\"><span class=\"icon-wrapper circle-no\"><i class=\"fusion-li-icon awb-icon-check\" aria-hidden=\"true\"><\/i><\/span><div class=\"fusion-li-item-content\">Les techniques d'encodage conventionnelles pour les caract\u00e9ristiques cat\u00e9gorielles ne sont pas id\u00e9ales pour les pr\u00e9visions, en particulier lorsque les s\u00e9ries temporelles pr\u00e9sentent des tendances marqu\u00e9es et que l'horizon de pr\u00e9vision est lointain.<\/div><\/li><li class=\"fusion-li-item\" style=\"\"><span class=\"icon-wrapper circle-no\"><i class=\"fusion-li-icon awb-icon-check\" aria-hidden=\"true\"><\/i><\/span><div class=\"fusion-li-item-content\">\n<p>Notre m\u00e9thode est une variante de l'empilement de mod\u00e8les, puisque nous utilisons un mod\u00e8le Proph\u00e8te - qui poss\u00e8de des capacit\u00e9s sup\u00e9rieures de mod\u00e9lisation et d'extrapolation des tendances - pour construire l'encodage des caract\u00e9ristiques cat\u00e9gorielles.<\/p>\n<\/div><\/li><li class=\"fusion-li-item\" style=\"\"><span class=\"icon-wrapper circle-no\"><i class=\"fusion-li-icon awb-icon-check\" aria-hidden=\"true\"><\/i><\/span><div class=\"fusion-li-item-content\">\n<p>Nos exp\u00e9riences ont d\u00e9montr\u00e9 les avantages de la r\u00e9duction des biais et de l'am\u00e9lioration de la pr\u00e9cision des pr\u00e9visions.<\/p>\n<\/div><\/li><\/ul><div class=\"fusion-text fusion-text-24\"><p>Nous pr\u00e9voyons de publier un document dans les mois \u00e0 venir, qui contiendra tous les d\u00e9tails de notre approche et de notre mise en \u0153uvre. <a href=\"https:\/\/www.artefact.com\/fr\/blog\/\">Restez \u00e0 l'\u00e9coute<\/a> pour des mises \u00e0 jour ult\u00e9rieures !<\/p>\n<\/div><\/div><\/div><\/div><\/article><div class=\"fusion-fullwidth fullwidth-box fusion-builder-row-5 fusion-flex-container nonhundred-percent-fullwidth non-hundred-percent-height-scrolling\" style=\"--awb-border-radius-top-left:0px;--awb-border-radius-top-right:0px;--awb-border-radius-bottom-right:0px;--awb-border-radius-bottom-left:0px;--awb-margin-top:40px;--awb-margin-bottom:40px;--awb-flex-wrap:wrap;\" ><div class=\"fusion-builder-row fusion-row fusion-flex-align-items-center fusion-flex-justify-content-center fusion-flex-content-wrap\" style=\"max-width:calc( 1440px + 20px );margin-left: calc(-20px \/ 2 );margin-right: calc(-20px \/ 2 );\"><div class=\"fusion-layout-column fusion_builder_column fusion-builder-column-4 fusion_builder_column_1_1 1_1 fusion-flex-column fusion-flex-align-self-center\" style=\"--awb-padding-top:40px;--awb-padding-right:40px;--awb-padding-bottom:40px;--awb-padding-left:40px;--awb-overflow:hidden;--awb-bg-position:left center;--awb-bg-size:cover;--awb-border-color:rgba(10,17,40,0.1);--awb-border-style:solid;--awb-border-radius:4px 4px 4px 4px;--awb-width-large:100%;--awb-margin-top-large:0px;--awb-spacing-right-large:10px;--awb-margin-bottom-large:0px;--awb-spacing-left-large:10px;--awb-width-medium:100%;--awb-order-medium:0;--awb-spacing-right-medium:10px;--awb-spacing-left-medium:10px;--awb-width-small:100%;--awb-order-small:0;--awb-spacing-right-small:10px;--awb-spacing-left-small:10px;\"><div class=\"fusion-column-wrapper lazyload fusion-column-has-shadow fusion-flex-justify-content-center fusion-content-layout-column fusion-column-has-bg-image\" data-bg-url=\"https:\/\/www.artefact.com\/\/wp-content\/uploads\/2021\/03\/background.jpg\" data-bg=\"https:\/\/www.artefact.com\/\/wp-content\/uploads\/2021\/03\/background.jpg\"><div class=\"fusion-image-element\" style=\"text-align:center;--awb-margin-right:20px;--awb-margin-left:20px;--awb-max-width:150px;--awb-caption-title-font-family:var(--h2_typography-font-family);--awb-caption-title-font-weight:var(--h2_typography-font-weight);--awb-caption-title-font-style:var(--h2_typography-font-style);--awb-caption-title-size:var(--h2_typography-font-size);--awb-caption-title-transform:var(--h2_typography-text-transform);--awb-caption-title-line-height:var(--h2_typography-line-height);--awb-caption-title-letter-spacing:var(--h2_typography-letter-spacing);\"><span class=\"fusion-imageframe imageframe-none imageframe-9 hover-type-none\"><img decoding=\"async\" width=\"72\" height=\"41\" title=\"moyen\" src=\"data:image\/svg+xml,%3Csvg%20xmlns%3D%27http%3A%2F%2Fwww.w3.org%2F2000%2Fsvg%27%20width%3D%2772%27%20height%3D%2741%27%20viewBox%3D%270%200%2072%2041%27%3E%3Crect%20width%3D%2772%27%20height%3D%2741%27%20fill-opacity%3D%220%22%2F%3E%3C%2Fsvg%3E\" data-orig-src=\"https:\/\/www.artefact.com\/\/wp-content\/uploads\/2021\/03\/medium.png\" alt class=\"lazyload img-responsive wp-image-60927\"\/><\/span><\/div><div class=\"fusion-title title fusion-title-15 fusion-sep-none fusion-title-center fusion-title-text fusion-title-size-three\" style=\"--awb-margin-top:20px;--awb-margin-bottom:0px;--awb-margin-bottom-small:8px;\"><h3 class=\"fusion-title-heading title-heading-center fusion-responsive-typography-calculated\" style=\"margin:0;--fontSize:20;line-height:1.2;\">Moyen Blog par Artefact.<\/h3><\/div><div class=\"fusion-text fusion-text-25\" style=\"--awb-content-alignment:center;\"><p>Cet article a \u00e9t\u00e9 initialement publi\u00e9 sur <strong>Medium.com<\/strong>.<br \/>\nSuivez-nous sur notre Medium Blog !<\/p>\n<\/div><div style=\"text-align:center;\"><a class=\"fusion-button button-flat button-medium button-default fusion-button-default button-1 fusion-button-default-span fusion-button-default-type\" target=\"_blank\" rel=\"noopener noreferrer\" href=\"https:\/\/medium.com\/artefact-engineering-and-data-science\/encoding-categorical-features-in-forecasting-are-we-all-doing-it-wrong-fe8a9a6488da\"><span class=\"fusion-button-text awb-button__text awb-button__text--default\">Lire notre article<\/span><\/a><\/div><\/div><\/div><\/div><\/div><\/p>","protected":false},"excerpt":{"rendered":"<p>Nous proposons une nouvelle m\u00e9thode d'encodage des caract\u00e9ristiques cat\u00e9gorielles sp\u00e9cifiquement adapt\u00e9e aux applications de pr\u00e9vision.<\/p>","protected":false},"featured_media":71575,"parent":0,"template":"","meta":{"_acf_changed":false,"ep_exclude_from_search":false},"blog-category":[21939],"blog-language":[2991],"class_list":["post-71574","blog","type-blog","status-publish","has-post-thumbnail","hentry","blog-category-medium","blog-language-en"],"acf":[],"_links":{"self":[{"href":"https:\/\/www.artefact.com\/fr\/wp-json\/wp\/v2\/blog\/71574","targetHints":{"allow":["GET"]}}],"collection":[{"href":"https:\/\/www.artefact.com\/fr\/wp-json\/wp\/v2\/blog"}],"about":[{"href":"https:\/\/www.artefact.com\/fr\/wp-json\/wp\/v2\/types\/blog"}],"wp:featuredmedia":[{"embeddable":true,"href":"https:\/\/www.artefact.com\/fr\/wp-json\/wp\/v2\/media\/71575"}],"wp:attachment":[{"href":"https:\/\/www.artefact.com\/fr\/wp-json\/wp\/v2\/media?parent=71574"}],"wp:term":[{"taxonomy":"blog-category","embeddable":true,"href":"https:\/\/www.artefact.com\/fr\/wp-json\/wp\/v2\/blog-category?post=71574"},{"taxonomy":"blog-language","embeddable":true,"href":"https:\/\/www.artefact.com\/fr\/wp-json\/wp\/v2\/blog-language?post=71574"}],"curies":[{"name":"wp","href":"https:\/\/api.w.org\/{rel}","templated":true}]}}