	{"id":71574,"date":"2023-07-05T14:29:02","date_gmt":"2023-07-05T13:29:02","guid":{"rendered":"https:\/\/www.artefact.com\/?post_type=blog&#038;p=71574"},"modified":"2024-09-20T17:45:57","modified_gmt":"2024-09-20T16:45:57","slug":"encoding-categorical-features-in-forecasting-are-we-all-doing-it-wrong","status":"publish","type":"blog","link":"https:\/\/www.artefact.com\/br\/blog\/encoding-categorical-features-in-forecasting-are-we-all-doing-it-wrong\/","title":{"rendered":"Codifica\u00e7\u00e3o de caracter\u00edsticas categ\u00f3ricas na previs\u00e3o: estamos todos fazendo isso errado?"},"content":{"rendered":"<p><div class=\"fusion-fullwidth fullwidth-box fusion-builder-row-1 fusion-flex-container nonhundred-percent-fullwidth non-hundred-percent-height-scrolling article-author\" style=\"--awb-border-radius-top-left:0px;--awb-border-radius-top-right:0px;--awb-border-radius-bottom-right:0px;--awb-border-radius-bottom-left:0px;--awb-background-color:#ffffff;--awb-flex-wrap:wrap;\" ><div class=\"fusion-builder-row fusion-row fusion-flex-align-items-flex-start fusion-flex-content-wrap\" style=\"max-width:calc( 1440px + 20px );margin-left: calc(-20px \/ 2 );margin-right: calc(-20px \/ 2 );\"><div class=\"fusion-layout-column fusion_builder_column fusion-builder-column-0 fusion_builder_column_1_2 1_2 fusion-flex-column\" style=\"--awb-bg-size:cover;--awb-width-large:50%;--awb-margin-top-large:0px;--awb-spacing-right-large:10px;--awb-margin-bottom-large:0px;--awb-spacing-left-large:10px;--awb-width-medium:50%;--awb-order-medium:0;--awb-spacing-right-medium:10px;--awb-spacing-left-medium:10px;--awb-width-small:100%;--awb-order-small:0;--awb-spacing-right-small:10px;--awb-spacing-left-small:10px;\"><div class=\"fusion-column-wrapper fusion-column-has-shadow fusion-flex-justify-content-flex-start fusion-content-layout-column\"><div class=\"fusion-title title fusion-title-1 fusion-sep-none fusion-title-text fusion-title-size-two\" style=\"--awb-margin-bottom-small:8px;\"><h2 class=\"fusion-title-heading title-heading-left fusion-responsive-typography-calculated\" style=\"margin:0;--fontSize:50;line-height:1.2;\">Autor<\/h2><\/div><img decoding=\"async\" src=\"data:image\/svg+xml,%3Csvg%20xmlns%3D%27http%3A%2F%2Fwww.w3.org%2F2000%2Fsvg%27%20width%3D%27150%27%20height%3D%270%27%20viewBox%3D%270%200%20150%200%27%3E%3Crect%20width%3D%27150%27%20height%3D%270%27%20fill-opacity%3D%220%22%2F%3E%3C%2Fsvg%3E\" data-orig-src=\"https:\/\/www.artefact.com\/\/wp-content\/uploads\/2023\/07\/Youssef-Oudghiri.jpeg\" alt=\"Image\" class=\"lazyload artefact-elegant-image align-left article-author-image\" style=\"width: 150px; border-radius: 54% 46% 77% 23% \/ 74% 40% 60% 26%; overflow: hidden;\" width=\"150\" height=\"auto\" \/><div class=\"fusion-title title fusion-title-2 fusion-sep-none fusion-title-text fusion-title-size-three article-author-name-title\" style=\"--awb-margin-bottom-small:8px;\"><h3 class=\"fusion-title-heading title-heading-left fusion-responsive-typography-calculated\" style=\"margin:0;--fontSize:20;line-height:1.2;\">O senhor Oudghiri<\/h3><\/div><div class=\"fusion-text fusion-text-1 article-author-description\" style=\"--awb-text-transform:none;\"><p>Data Cientista da Artefact Fran\u00e7a<\/p>\n<\/div><\/div><\/div><\/div><\/div><div class=\"fusion-fullwidth fullwidth-box fusion-builder-row-2 fusion-flex-container nonhundred-percent-fullwidth non-hundred-percent-height-scrolling\" style=\"--awb-border-radius-top-left:0px;--awb-border-radius-top-right:0px;--awb-border-radius-bottom-right:0px;--awb-border-radius-bottom-left:0px;--awb-flex-wrap:wrap;\" ><div class=\"fusion-builder-row fusion-row fusion-flex-align-items-flex-start fusion-flex-content-wrap\" style=\"max-width:calc( 1440px + 20px );margin-left: calc(-20px \/ 2 );margin-right: calc(-20px \/ 2 );\"><div class=\"fusion-layout-column fusion_builder_column fusion-builder-column-1 fusion_builder_column_1_1 1_1 fusion-flex-column\" style=\"--awb-bg-size:cover;--awb-width-large:100%;--awb-margin-top-large:0px;--awb-spacing-right-large:10px;--awb-margin-bottom-large:0px;--awb-spacing-left-large:10px;--awb-width-medium:100%;--awb-order-medium:0;--awb-spacing-right-medium:10px;--awb-spacing-left-medium:10px;--awb-width-small:100%;--awb-order-small:0;--awb-spacing-right-small:10px;--awb-spacing-left-small:10px;\"><div class=\"fusion-column-wrapper fusion-column-has-shadow fusion-flex-justify-content-flex-start fusion-content-layout-column\"><div class=\"fusion-text fusion-text-2 description\"><p>Propomos um novo m\u00e9todo para codificar recursos categ\u00f3ricos especificamente adaptados para aplicativos de previs\u00e3o. Em ess\u00eancia, essa abordagem codifica recursos categ\u00f3ricos modelando a tend\u00eancia das quantidades associadas a cada categoria. Em nossos experimentos, essa abordagem mostra benef\u00edcios substanciais de desempenho, tanto em termos de precis\u00e3o de previs\u00e3o quanto de vi\u00e9s, pois permite que os modelos de conjunto baseados em \u00e1rvore modelem e extrapolem melhor as tend\u00eancias.<\/p>\n<\/div><\/div><\/div><\/div><\/div><div class=\"fusion-fullwidth fullwidth-box fusion-builder-row-3 fusion-flex-container nonhundred-percent-fullwidth non-hundred-percent-height-scrolling\" style=\"--awb-border-radius-top-left:0px;--awb-border-radius-top-right:0px;--awb-border-radius-bottom-right:0px;--awb-border-radius-bottom-left:0px;--awb-margin-top:40px;--awb-margin-bottom:40px;--awb-flex-wrap:wrap;\" ><div class=\"fusion-builder-row fusion-row fusion-flex-align-items-center fusion-flex-justify-content-center fusion-flex-content-wrap\" style=\"max-width:calc( 1440px + 20px );margin-left: calc(-20px \/ 2 );margin-right: calc(-20px \/ 2 );\"><div class=\"fusion-layout-column fusion_builder_column fusion-builder-column-2 fusion_builder_column_1_1 1_1 fusion-flex-column fusion-flex-align-self-center fusion-column-inner-bg-wrapper\" style=\"--awb-padding-top:20px;--awb-padding-right:20px;--awb-padding-bottom:20px;--awb-padding-left:20px;--awb-overflow:hidden;--awb-inner-bg-size:cover;--awb-border-color:rgba(10,17,40,0.1);--awb-border-top:1px;--awb-border-right:1px;--awb-border-bottom:1px;--awb-border-left:1px;--awb-border-style:solid;--awb-border-radius:4px 4px 4px 4px;--awb-inner-bg-border-radius:4px 4px 4px 4px;--awb-inner-bg-overflow:hidden;--awb-width-large:100%;--awb-margin-top-large:0px;--awb-spacing-right-large:10px;--awb-margin-bottom-large:0px;--awb-spacing-left-large:10px;--awb-width-medium:100%;--awb-order-medium:0;--awb-spacing-right-medium:10px;--awb-spacing-left-medium:10px;--awb-width-small:100%;--awb-order-small:0;--awb-spacing-right-small:10px;--awb-spacing-left-small:10px;\" data-scroll-devices=\"small-visibility,medium-visibility,large-visibility\"><span class=\"fusion-column-inner-bg hover-type-none\"><a class=\"fusion-column-anchor\" href=\"https:\/\/medium.com\/artefact-engineering-and-data-science\/encoding-categorical-features-in-forecasting-are-we-all-doing-it-wrong-fe8a9a6488da\" rel=\"noopener noreferrer\" target=\"_blank\"><span class=\"fusion-column-inner-bg-image\"><\/span><\/a><\/span><div class=\"fusion-column-wrapper fusion-column-has-shadow fusion-flex-justify-content-center fusion-content-layout-row fusion-flex-align-items-center\"><div class=\"fusion-text fusion-text-3\"><p><u>Leia nosso artigo sobre<\/u><\/p>\n<\/div><div class=\"fusion-image-element\" style=\"--awb-margin-right:20px;--awb-margin-left:20px;--awb-max-width:150px;--awb-caption-title-font-family:var(--h2_typography-font-family);--awb-caption-title-font-weight:var(--h2_typography-font-weight);--awb-caption-title-font-style:var(--h2_typography-font-style);--awb-caption-title-size:var(--h2_typography-font-size);--awb-caption-title-transform:var(--h2_typography-text-transform);--awb-caption-title-line-height:var(--h2_typography-line-height);--awb-caption-title-letter-spacing:var(--h2_typography-letter-spacing);\"><span class=\"fusion-imageframe imageframe-none imageframe-1 hover-type-none\"><img decoding=\"async\" width=\"4000\" height=\"992\" title=\"M\u00e9dio Blog\" src=\"https:\/\/www.artefact.com\/\/wp-content\/uploads\/2021\/04\/Medium-Blog.png\" data-orig-src=\"https:\/\/www.artefact.com\/\/wp-content\/uploads\/2021\/04\/Medium-Blog.png\" alt class=\"lazyload img-responsive wp-image-60582\" srcset=\"data:image\/svg+xml,%3Csvg%20xmlns%3D%27http%3A%2F%2Fwww.w3.org%2F2000%2Fsvg%27%20width%3D%274000%27%20height%3D%27992%27%20viewBox%3D%270%200%204000%20992%27%3E%3Crect%20width%3D%274000%27%20height%3D%27992%27%20fill-opacity%3D%220%22%2F%3E%3C%2Fsvg%3E\" data-srcset=\"https:\/\/www.artefact.com\/\/wp-content\/uploads\/2021\/04\/Medium-Blog-200x50.png 200w, https:\/\/www.artefact.com\/\/wp-content\/uploads\/2021\/04\/Medium-Blog-400x99.png 400w, https:\/\/www.artefact.com\/\/wp-content\/uploads\/2021\/04\/Medium-Blog-600x149.png 600w, https:\/\/www.artefact.com\/\/wp-content\/uploads\/2021\/04\/Medium-Blog-800x198.png 800w, https:\/\/www.artefact.com\/\/wp-content\/uploads\/2021\/04\/Medium-Blog-1200x298.png 1200w, https:\/\/www.artefact.com\/\/wp-content\/uploads\/2021\/04\/Medium-Blog.png 4000w\" data-sizes=\"auto\" data-orig-sizes=\"(max-width: 640px) 100vw, 4000px\" \/><\/span><\/div><div class=\"fusion-text fusion-text-4\"><p>.<\/p>\n<\/div><\/div><\/div><\/div><\/div><article class=\"fusion-fullwidth fullwidth-box fusion-builder-row-4 fusion-flex-container nonhundred-percent-fullwidth non-hundred-percent-height-scrolling\" style=\"--awb-border-radius-top-left:0px;--awb-border-radius-top-right:0px;--awb-border-radius-bottom-right:0px;--awb-border-radius-bottom-left:0px;--awb-flex-wrap:wrap;\" ><div class=\"fusion-builder-row fusion-row fusion-flex-align-items-flex-start fusion-flex-justify-content-center fusion-flex-content-wrap\" style=\"max-width:calc( 1440px + 20px );margin-left: calc(-20px \/ 2 );margin-right: calc(-20px \/ 2 );\"><div class=\"fusion-layout-column fusion_builder_column fusion-builder-column-3 fusion_builder_column_1_1 1_1 fusion-flex-column\" style=\"--awb-bg-size:cover;--awb-width-large:100%;--awb-margin-top-large:0px;--awb-spacing-right-large:10px;--awb-margin-bottom-large:0px;--awb-spacing-left-large:10px;--awb-width-medium:100%;--awb-order-medium:0;--awb-spacing-right-medium:10px;--awb-spacing-left-medium:10px;--awb-width-small:100%;--awb-order-small:0;--awb-spacing-right-small:10px;--awb-spacing-left-small:10px;\"><div class=\"fusion-column-wrapper fusion-column-has-shadow fusion-flex-justify-content-flex-start fusion-content-layout-column\"><div class=\"fusion-title title fusion-title-3 fusion-sep-none fusion-title-text fusion-title-size-two\" style=\"--awb-margin-bottom-small:8px;\"><h2 class=\"fusion-title-heading title-heading-left fusion-responsive-typography-calculated\" style=\"margin:0;--fontSize:50;line-height:1.2;\">Introdu\u00e7\u00e3o<\/h2><\/div><div class=\"fusion-text fusion-text-5\"><p>A motiva\u00e7\u00e3o para este trabalho surgiu de v\u00e1rios projetos de previs\u00e3o de clientes no Artefact, nos quais nossos modelos de refor\u00e7o apresentaram alta tend\u00eancia no momento da previs\u00e3o. Por meio de uma fase de diagn\u00f3stico, identificamos que uma das principais fontes de vi\u00e9s nos modelos de aprendizado de conjunto decorria de seus desafios em modelar com precis\u00e3o as tend\u00eancias e os n\u00edveis flutuantes.<\/p>\n<p>A seguir, demonstraremos\u00a0<strong>por que<\/strong>\u00a0e\u00a0<strong>como<\/strong>\u00a0usamos uma nova abordagem para codificar recursos categ\u00f3ricos. Com base em nossos experimentos envolvendo um projeto de previs\u00e3o de varejo de um cliente e v\u00e1rios datasets p\u00fablicos, provamos que essa t\u00e9cnica pode efetivamente atenuar o vi\u00e9s e aumentar a precis\u00e3o.<\/p>\n<\/div><div class=\"fusion-title title fusion-title-4 fusion-sep-none fusion-title-text fusion-title-size-two\" style=\"--awb-margin-bottom-small:8px;\"><h2 class=\"fusion-title-heading title-heading-left fusion-responsive-typography-calculated\" style=\"margin:0;--fontSize:50;line-height:1.2;\">Boosting e tend\u00eancias, por que \u00e9 complexo?<\/h2><\/div><div class=\"fusion-title title fusion-title-5 fusion-sep-none fusion-title-text fusion-title-size-three\" style=\"--awb-margin-bottom-small:8px;\"><h3 class=\"fusion-title-heading title-heading-left fusion-responsive-typography-calculated\" style=\"margin:0;--fontSize:20;line-height:1.2;\">Os algoritmos de refor\u00e7o t\u00eam dificuldade para extrapolar<\/h3><\/div><div class=\"fusion-text fusion-text-6\"><p>Os algoritmos de refor\u00e7o t\u00eam dificuldade para modelar e extrapolar tend\u00eancias, pois n\u00e3o podem prever novos valores n\u00e3o vistos no conjunto de treinamento\/ausentes nas folhas. \u201c<a href=\"https:\/\/pypi.org\/project\/linear-tree\/\" target=\"_blank\" rel=\"noopener ugc nofollow\">\u00c1rvore Linear<\/a>\u201dOs modelos \" tentam aliviar esse problema, mas nossos testes produziram resultados inconclusivos com esse m\u00e9todo.<\/p>\n<\/div><div class=\"fusion-title title fusion-title-6 fusion-sep-none fusion-title-text fusion-title-size-three\" style=\"--awb-margin-bottom-small:8px;\"><h3 class=\"fusion-title-heading title-heading-left fusion-responsive-typography-calculated\" style=\"margin:0;--fontSize:20;line-height:1.2;\">Codifica\u00e7\u00f5es cl\u00e1ssicas levam a previs\u00f5es est\u00e1ticas<\/h3><\/div><div class=\"fusion-text fusion-text-7\"><p>Os m\u00e9todos de codifica\u00e7\u00e3o mais comuns empregados no boosting promovem rela\u00e7\u00f5es est\u00e1ticas entre vari\u00e1veis independentes e dependentes, o que, por sua vez, contribui para aumentar o vi\u00e9s na presen\u00e7a de tend\u00eancias. O diagrama abaixo ilustra esse fen\u00f4meno:<\/p>\n<\/div><div class=\"fusion-image-element\" style=\"--awb-caption-title-font-family:var(--h2_typography-font-family);--awb-caption-title-font-weight:var(--h2_typography-font-weight);--awb-caption-title-font-style:var(--h2_typography-font-style);--awb-caption-title-size:var(--h2_typography-font-size);--awb-caption-title-transform:var(--h2_typography-text-transform);--awb-caption-title-line-height:var(--h2_typography-line-height);--awb-caption-title-letter-spacing:var(--h2_typography-letter-spacing);\"><span class=\"fusion-imageframe imageframe-none imageframe-2 hover-type-none\"><img decoding=\"async\" width=\"1400\" height=\"477\" alt=\"Classical encodings push towards static predictions\" title=\"Codifica\u00e7\u00f5es cl\u00e1ssicas levam a previs\u00f5es est\u00e1ticas\" src=\"https:\/\/www.artefact.com\/\/wp-content\/uploads\/2023\/07\/Classical-encodings-push-towards-static-predictions.webp\" data-orig-src=\"https:\/\/www.artefact.com\/\/wp-content\/uploads\/2023\/07\/Classical-encodings-push-towards-static-predictions.webp\" class=\"lazyload img-responsive wp-image-71578\" srcset=\"data:image\/svg+xml,%3Csvg%20xmlns%3D%27http%3A%2F%2Fwww.w3.org%2F2000%2Fsvg%27%20width%3D%271400%27%20height%3D%27477%27%20viewBox%3D%270%200%201400%20477%27%3E%3Crect%20width%3D%271400%27%20height%3D%27477%27%20fill-opacity%3D%220%22%2F%3E%3C%2Fsvg%3E\" data-srcset=\"https:\/\/www.artefact.com\/\/wp-content\/uploads\/2023\/07\/Classical-encodings-push-towards-static-predictions-200x68.webp 200w, https:\/\/www.artefact.com\/\/wp-content\/uploads\/2023\/07\/Classical-encodings-push-towards-static-predictions-400x136.webp 400w, https:\/\/www.artefact.com\/\/wp-content\/uploads\/2023\/07\/Classical-encodings-push-towards-static-predictions-600x204.webp 600w, https:\/\/www.artefact.com\/\/wp-content\/uploads\/2023\/07\/Classical-encodings-push-towards-static-predictions-800x273.webp 800w, https:\/\/www.artefact.com\/\/wp-content\/uploads\/2023\/07\/Classical-encodings-push-towards-static-predictions-1200x409.webp 1200w, https:\/\/www.artefact.com\/\/wp-content\/uploads\/2023\/07\/Classical-encodings-push-towards-static-predictions.webp 1400w\" data-sizes=\"auto\" data-orig-sizes=\"(max-width: 640px) 100vw, 1400px\" \/><\/span><\/div><div class=\"fusion-text fusion-text-8\"><p style=\"text-align: center;\"><em>Representa\u00e7\u00e3o visual simplificada que destaca a natureza est\u00e1tica da codifica\u00e7\u00e3o de caracter\u00edsticas categ\u00f3ricas empregada em algoritmos de refor\u00e7o<\/em><\/p>\n<\/div><div class=\"fusion-text fusion-text-9\"><p>Reconhecemos que a representa\u00e7\u00e3o acima \u00e9 uma simplifica\u00e7\u00e3o exagerada, pois as \u00e1rvores de decis\u00e3o s\u00e3o mais complexas e capazes de identificar rela\u00e7\u00f5es n\u00e3o lineares com base em v\u00e1rios fatores. De fato, a condi\u00e7\u00e3o \u201ca cor \u00e9 preta\u201d poderia estar associada a \u201co m\u00eas \u00e9 junho\u201d. Nesse caso, o fato de a cor ser preta n\u00e3o teria o mesmo impacto em todos os momentos. Mas vamos analisar o quadro geral:<\/p>\n<\/div><ul style=\"--awb-line-height:27.2px;--awb-icon-width:27.2px;--awb-icon-height:27.2px;--awb-icon-margin:11.2px;--awb-content-margin:38.4px;\" class=\"fusion-checklist fusion-checklist-1 fusion-checklist-default type-icons paddingList dark-text\"><li class=\"fusion-li-item\" style=\"\"><span class=\"icon-wrapper circle-no\"><i class=\"fusion-li-icon awb-icon-check\" aria-hidden=\"true\"><\/i><\/span><div class=\"fusion-li-item-content\">\n<p>Atribuir um \u00fanico impacto para a cor preta em junho ainda n\u00e3o \u00e9 o ideal, pois o impacto em junho de 2021 pode ser diferente do impacto em junho de 2022. Mesmo se incluirmos o ano, primeiro o limite de decis\u00e3o se tornaria muito complexo para ser criado e identificado, mas tamb\u00e9m, o que aconteceria se o treinamento data terminasse em 2022 e as previs\u00f5es precisassem ser feitas para 2023?<\/p>\n<\/div><\/li><li class=\"fusion-li-item\" style=\"\"><span class=\"icon-wrapper circle-no\"><i class=\"fusion-li-icon awb-icon-check\" aria-hidden=\"true\"><\/i><\/span><div class=\"fusion-li-item-content\">\n<p><strong>A engenharia de recursos tem o objetivo de ajudar o modelo a identificar relacionamentos mais facilmente<\/strong>. Se pudermos ajudar o modelo a associar o impacto da cor preta em qualquer momento, sem exigir a identifica\u00e7\u00e3o de rela\u00e7\u00f5es complexas, isso seria altamente vantajoso para o modelo. Portanto, ...<\/p>\n<\/div><\/li><\/ul><div class=\"fusion-title title fusion-title-7 fusion-sep-none fusion-title-text fusion-title-size-two\" style=\"--awb-margin-bottom-small:8px;\"><h2 class=\"fusion-title-heading title-heading-left fusion-responsive-typography-calculated\" style=\"margin:0;--fontSize:50;line-height:1.2;\">Nossa nova abordagem: Codifica\u00e7\u00e3o din\u00e2mica de caracter\u00edsticas categ\u00f3ricas<\/h2><\/div><div class=\"fusion-title title fusion-title-8 fusion-sep-none fusion-title-text fusion-title-size-three\" style=\"--awb-margin-bottom-small:8px;\"><h3 class=\"fusion-title-heading title-heading-left fusion-responsive-typography-calculated\" style=\"margin:0;--fontSize:20;line-height:1.2;\">Base da codifica\u00e7\u00e3o din\u00e2mica (v1 sem n\u00edvel de item)<\/h3><\/div><div class=\"fusion-text fusion-text-10\"><p id=\"9918\" data-selectable-paragraph=\"\">Em uma frase, nosso m\u00e9todo de codifica\u00e7\u00e3o de recursos categ\u00f3ricos poderia ser descrito como:\u00a0<strong>modelamos o componente de tend\u00eancia de cada categoria e usamos esses valores de tend\u00eancia para codificar esse recurso categ\u00f3rico<\/strong>.<\/p>\n<p id=\"b6c1\" data-selectable-paragraph=\"\">O diagrama abaixo ilustra a diferen\u00e7a entre uma codifica\u00e7\u00e3o m\u00e9dia est\u00e1tica e uma codifica\u00e7\u00e3o baseada em tend\u00eancias para duas categorias de cores: preto e dourado.<\/p>\n<\/div><div class=\"fusion-image-element\" style=\"--awb-caption-title-font-family:var(--h2_typography-font-family);--awb-caption-title-font-weight:var(--h2_typography-font-weight);--awb-caption-title-font-style:var(--h2_typography-font-style);--awb-caption-title-size:var(--h2_typography-font-size);--awb-caption-title-transform:var(--h2_typography-text-transform);--awb-caption-title-line-height:var(--h2_typography-line-height);--awb-caption-title-letter-spacing:var(--h2_typography-letter-spacing);\"><span class=\"fusion-imageframe imageframe-none imageframe-3 hover-type-none\"><img decoding=\"async\" width=\"1400\" height=\"453\" title=\"Base da codifica\u00e7\u00e3o din\u00e2mica (v1 sem n\u00edvel de item)\" src=\"https:\/\/www.artefact.com\/\/wp-content\/uploads\/2023\/07\/Basis-of-dynamic-encoding-v1-without-item-level.webp\" data-orig-src=\"https:\/\/www.artefact.com\/\/wp-content\/uploads\/2023\/07\/Basis-of-dynamic-encoding-v1-without-item-level.webp\" alt class=\"lazyload img-responsive wp-image-71579\" srcset=\"data:image\/svg+xml,%3Csvg%20xmlns%3D%27http%3A%2F%2Fwww.w3.org%2F2000%2Fsvg%27%20width%3D%271400%27%20height%3D%27453%27%20viewBox%3D%270%200%201400%20453%27%3E%3Crect%20width%3D%271400%27%20height%3D%27453%27%20fill-opacity%3D%220%22%2F%3E%3C%2Fsvg%3E\" data-srcset=\"https:\/\/www.artefact.com\/\/wp-content\/uploads\/2023\/07\/Basis-of-dynamic-encoding-v1-without-item-level-200x65.webp 200w, https:\/\/www.artefact.com\/\/wp-content\/uploads\/2023\/07\/Basis-of-dynamic-encoding-v1-without-item-level-400x129.webp 400w, https:\/\/www.artefact.com\/\/wp-content\/uploads\/2023\/07\/Basis-of-dynamic-encoding-v1-without-item-level-600x194.webp 600w, https:\/\/www.artefact.com\/\/wp-content\/uploads\/2023\/07\/Basis-of-dynamic-encoding-v1-without-item-level-800x259.webp 800w, https:\/\/www.artefact.com\/\/wp-content\/uploads\/2023\/07\/Basis-of-dynamic-encoding-v1-without-item-level-1200x388.webp 1200w, https:\/\/www.artefact.com\/\/wp-content\/uploads\/2023\/07\/Basis-of-dynamic-encoding-v1-without-item-level.webp 1400w\" data-sizes=\"auto\" data-orig-sizes=\"(max-width: 640px) 100vw, 1400px\" \/><\/span><\/div><div class=\"fusion-text fusion-text-11\"><p style=\"text-align: center;\"><em>Ilustra\u00e7\u00e3o mostrando o princ\u00edpio da codifica\u00e7\u00e3o din\u00e2mica, que envolve a modelagem de tend\u00eancias para cada categoria<\/em><\/p>\n<\/div><div class=\"fusion-text fusion-text-12\"><p id=\"8d62\" data-selectable-paragraph=\"\">Em nossos experimentos, optamos por usar o Prophet para extrair o componente de tend\u00eancia. Naturalmente, tamb\u00e9m \u00e9 poss\u00edvel considerar outros modelos de previs\u00e3o de s\u00e9ries temporais.<\/p>\n<p id=\"5677\" data-selectable-paragraph=\"\">Observe que a codifica\u00e7\u00e3o m\u00e9dia est\u00e1tica implica que as vendas de itens pretos est\u00e3o em um n\u00edvel m\u00e9dio de 100 unidades\/m\u00eas em qualquer momento. A codifica\u00e7\u00e3o din\u00e2mica, por outro lado, permite levar em conta a tend\u00eancia de aumento observada nos itens pretos e \u00e9 capaz de extrapol\u00e1-la no futuro. Uma afirma\u00e7\u00e3o semelhante pode ser feita com rela\u00e7\u00e3o aos itens de ouro. Portanto, nossa abordagem ser\u00e1 especialmente \u00fatil em conjuntos data em que a vari\u00e1vel-alvo a ser prevista segue tend\u00eancias acentuadas nas diversas categorias dispon\u00edveis.<\/p>\n<p id=\"096b\" data-selectable-paragraph=\"\">Nosso foco principal \u00e9 permitir que o modelo se adapte mais facilmente \u00e0s rela\u00e7\u00f5es mut\u00e1veis entre as vari\u00e1veis independentes e a vari\u00e1vel dependente a ser prevista. Portanto, esse m\u00e9todo de codifica\u00e7\u00e3o din\u00e2mica tamb\u00e9m poderia ser aplicado a recursos num\u00e9ricos. Considere o exemplo do pre\u00e7o. Embora o pre\u00e7o seja num\u00e9rico e o modelo possa criar regras diretamente com base nele, as prefer\u00eancias das pessoas por itens baratos ou caros ainda podem evoluir com o tempo e seguir uma tend\u00eancia de vendas espec\u00edfica. No contexto de uma crise econ\u00f4mica, por exemplo, os produtos acess\u00edveis podem seguir uma tend\u00eancia de aumento de vendas, enquanto os caros podem seguir uma tend\u00eancia de redu\u00e7\u00e3o. Considerando \u2018acess\u00edvel\u2019 como uma categoria e \u2018caro\u2019 como outra, poder\u00edamos propor uma codifica\u00e7\u00e3o din\u00e2mica para o recurso de pre\u00e7o, assim como fizemos com as cores.<\/p>\n<p id=\"671a\" data-selectable-paragraph=\"\">\u00c9 importante observar que, para recursos num\u00e9ricos, tanto as vari\u00e1veis de base quanto as codificadas dinamicamente podem ser usadas no modelo, pois elas fornecer\u00e3o diferentes tipos de informa\u00e7\u00f5es.<\/p>\n<\/div><div class=\"fusion-title title fusion-title-9 fusion-sep-none fusion-title-text fusion-title-size-three\" style=\"--awb-margin-bottom-small:8px;\"><h3 class=\"fusion-title-heading title-heading-left fusion-responsive-typography-calculated\" style=\"margin:0;--fontSize:20;line-height:1.2;\">Dar mais import\u00e2ncia aos recursos din\u00e2micos (v2 com n\u00edvel de item)<\/h3><\/div><div class=\"fusion-text fusion-text-13\"><p id=\"c481\" data-selectable-paragraph=\"\">Embora esse novo m\u00e9todo de codifica\u00e7\u00e3o seja uma melhoria, muitas vezes a import\u00e2ncia dos recursos categ\u00f3ricos n\u00e3o \u00e9 alta o suficiente para afetar significativamente as previs\u00f5es ao examinar a import\u00e2ncia dos recursos. Para dar mais import\u00e2ncia aos recursos din\u00e2micos e, assim, promover uma melhor modelagem e extrapola\u00e7\u00e3o de tend\u00eancias, adaptamos os valores de codifica\u00e7\u00e3o a cada s\u00e9rie temporal\/item individualmente.<\/p>\n<\/div><div class=\"fusion-image-element\" style=\"--awb-caption-title-font-family:var(--h2_typography-font-family);--awb-caption-title-font-weight:var(--h2_typography-font-weight);--awb-caption-title-font-style:var(--h2_typography-font-style);--awb-caption-title-size:var(--h2_typography-font-size);--awb-caption-title-transform:var(--h2_typography-text-transform);--awb-caption-title-line-height:var(--h2_typography-line-height);--awb-caption-title-letter-spacing:var(--h2_typography-letter-spacing);\"><span class=\"fusion-imageframe imageframe-none imageframe-4 hover-type-none\"><img decoding=\"async\" width=\"1342\" height=\"296\" title=\"Dar mais import\u00e2ncia aos recursos din\u00e2micos (v2 com n\u00edvel de item)\" src=\"https:\/\/www.artefact.com\/\/wp-content\/uploads\/2023\/07\/Giving-more-importance-to-dynamic-features-v2-with-item-level.webp\" data-orig-src=\"https:\/\/www.artefact.com\/\/wp-content\/uploads\/2023\/07\/Giving-more-importance-to-dynamic-features-v2-with-item-level.webp\" alt class=\"lazyload img-responsive wp-image-71580\" srcset=\"data:image\/svg+xml,%3Csvg%20xmlns%3D%27http%3A%2F%2Fwww.w3.org%2F2000%2Fsvg%27%20width%3D%271342%27%20height%3D%27296%27%20viewBox%3D%270%200%201342%20296%27%3E%3Crect%20width%3D%271342%27%20height%3D%27296%27%20fill-opacity%3D%220%22%2F%3E%3C%2Fsvg%3E\" data-srcset=\"https:\/\/www.artefact.com\/\/wp-content\/uploads\/2023\/07\/Giving-more-importance-to-dynamic-features-v2-with-item-level-200x44.webp 200w, https:\/\/www.artefact.com\/\/wp-content\/uploads\/2023\/07\/Giving-more-importance-to-dynamic-features-v2-with-item-level-400x88.webp 400w, https:\/\/www.artefact.com\/\/wp-content\/uploads\/2023\/07\/Giving-more-importance-to-dynamic-features-v2-with-item-level-600x132.webp 600w, https:\/\/www.artefact.com\/\/wp-content\/uploads\/2023\/07\/Giving-more-importance-to-dynamic-features-v2-with-item-level-800x176.webp 800w, https:\/\/www.artefact.com\/\/wp-content\/uploads\/2023\/07\/Giving-more-importance-to-dynamic-features-v2-with-item-level-1200x265.webp 1200w, https:\/\/www.artefact.com\/\/wp-content\/uploads\/2023\/07\/Giving-more-importance-to-dynamic-features-v2-with-item-level.webp 1342w\" data-sizes=\"auto\" data-orig-sizes=\"(max-width: 640px) 100vw, 1342px\" \/><\/span><\/div><div class=\"fusion-text fusion-text-14\"><p style=\"text-align: center;\"><em>F\u00f3rmula que representa os dois componentes da codifica\u00e7\u00e3o din\u00e2mica: n\u00edvel de categoria e n\u00edvel de item<\/em><\/p>\n<\/div><div class=\"fusion-text fusion-text-15\"><p>Voltando ao nosso exemplo de cor, com dois itens pretos diferentes, isso permite que a codifica\u00e7\u00e3o din\u00e2mica da categoria \u201cpreto\u201d para cada item seja diferente com base em suas vendas individuais anteriores.<\/p>\n<\/div><div class=\"fusion-image-element\" style=\"--awb-caption-title-font-family:var(--h2_typography-font-family);--awb-caption-title-font-weight:var(--h2_typography-font-weight);--awb-caption-title-font-style:var(--h2_typography-font-style);--awb-caption-title-size:var(--h2_typography-font-size);--awb-caption-title-transform:var(--h2_typography-text-transform);--awb-caption-title-line-height:var(--h2_typography-line-height);--awb-caption-title-letter-spacing:var(--h2_typography-letter-spacing);\"><span class=\"fusion-imageframe imageframe-none imageframe-5 hover-type-none\"><img decoding=\"async\" width=\"1400\" height=\"587\" alt=\"Table illustrating the calculation of dynamic encoding through a simple example\" title=\"Tabela que ilustra o c\u00e1lculo da codifica\u00e7\u00e3o din\u00e2mica por meio de um exemplo simples\" src=\"https:\/\/www.artefact.com\/\/wp-content\/uploads\/2023\/07\/Table-illustrating-the-calculation-of-dynamic-encoding-through-a-simple-example.webp\" data-orig-src=\"https:\/\/www.artefact.com\/\/wp-content\/uploads\/2023\/07\/Table-illustrating-the-calculation-of-dynamic-encoding-through-a-simple-example.webp\" class=\"lazyload img-responsive wp-image-71582\" srcset=\"data:image\/svg+xml,%3Csvg%20xmlns%3D%27http%3A%2F%2Fwww.w3.org%2F2000%2Fsvg%27%20width%3D%271400%27%20height%3D%27587%27%20viewBox%3D%270%200%201400%20587%27%3E%3Crect%20width%3D%271400%27%20height%3D%27587%27%20fill-opacity%3D%220%22%2F%3E%3C%2Fsvg%3E\" data-srcset=\"https:\/\/www.artefact.com\/\/wp-content\/uploads\/2023\/07\/Table-illustrating-the-calculation-of-dynamic-encoding-through-a-simple-example-200x84.webp 200w, https:\/\/www.artefact.com\/\/wp-content\/uploads\/2023\/07\/Table-illustrating-the-calculation-of-dynamic-encoding-through-a-simple-example-400x168.webp 400w, https:\/\/www.artefact.com\/\/wp-content\/uploads\/2023\/07\/Table-illustrating-the-calculation-of-dynamic-encoding-through-a-simple-example-600x252.webp 600w, https:\/\/www.artefact.com\/\/wp-content\/uploads\/2023\/07\/Table-illustrating-the-calculation-of-dynamic-encoding-through-a-simple-example-800x335.webp 800w, https:\/\/www.artefact.com\/\/wp-content\/uploads\/2023\/07\/Table-illustrating-the-calculation-of-dynamic-encoding-through-a-simple-example-1200x503.webp 1200w, https:\/\/www.artefact.com\/\/wp-content\/uploads\/2023\/07\/Table-illustrating-the-calculation-of-dynamic-encoding-through-a-simple-example.webp 1400w\" data-sizes=\"auto\" data-orig-sizes=\"(max-width: 640px) 100vw, 1400px\" \/><\/span><\/div><div class=\"fusion-text fusion-text-16\"><p style=\"text-align: center;\"><em>Tabela que ilustra o c\u00e1lculo da codifica\u00e7\u00e3o din\u00e2mica por meio de um exemplo simples<\/em><\/p>\n<\/div><div class=\"fusion-title title fusion-title-10 fusion-sep-none fusion-title-text fusion-title-size-two\" style=\"--awb-margin-bottom-small:8px;\"><h2 class=\"fusion-title-heading title-heading-left fusion-responsive-typography-calculated\" style=\"margin:0;--fontSize:50;line-height:1.2;\">Experimentos e resultados<\/h2><\/div><div class=\"fusion-title title fusion-title-11 fusion-sep-none fusion-title-text fusion-title-size-three\" style=\"--awb-margin-bottom-small:8px;\"><h3 class=\"fusion-title-heading title-heading-left fusion-responsive-typography-calculated\" style=\"margin:0;--fontSize:20;line-height:1.2;\">Cliente dataset<\/h3><\/div><div class=\"fusion-text fusion-text-17\"><p>Usamos nossa abordagem para prever as vendas de um de nossos clientes do setor de varejo. Validamos completamente nosso m\u00e9todo em uma ampla gama de escopos para garantir sua efic\u00e1cia. Aqui est\u00e3o alguns pontos data referentes ao contexto experimental:<\/p>\n<\/div><ul style=\"--awb-line-height:27.2px;--awb-icon-width:27.2px;--awb-icon-height:27.2px;--awb-icon-margin:11.2px;--awb-content-margin:38.4px;\" class=\"fusion-checklist fusion-checklist-2 fusion-checklist-default type-icons paddingList dark-text\"><li class=\"fusion-li-item\" style=\"\"><span class=\"icon-wrapper circle-no\"><i class=\"fusion-li-icon awb-icon-check\" aria-hidden=\"true\"><\/i><\/span><div class=\"fusion-li-item-content\">Os experimentos foram realizados em 9 escopos de produtos diferentes, com um modelo de refor\u00e7o (LightGBM) para cada escopo.<\/div><\/li><li class=\"fusion-li-item\" style=\"\"><span class=\"icon-wrapper circle-no\"><i class=\"fusion-li-icon awb-icon-check\" aria-hidden=\"true\"><\/i><\/span><div class=\"fusion-li-item-content\">\n<p>Para cada escopo, foi realizada uma valida\u00e7\u00e3o cruzada k-fold com uma janela de expans\u00e3o (k=5).<\/p>\n<\/div><\/li><li class=\"fusion-li-item\" style=\"\"><span class=\"icon-wrapper circle-no\"><i class=\"fusion-li-icon awb-icon-check\" aria-hidden=\"true\"><\/i><\/span><div class=\"fusion-li-item-content\">\n<p>Horizonte de previs\u00e3o: Dia+1 a Dia+180.<\/p>\n<\/div><\/li><li class=\"fusion-li-item\" style=\"\"><span class=\"icon-wrapper circle-no\"><i class=\"fusion-li-icon awb-icon-check\" aria-hidden=\"true\"><\/i><\/span><div class=\"fusion-li-item-content\">\n<p>O desempenho foi avaliado por meio de duas m\u00e9tricas:<\/p>\n<\/div><\/li><\/ul><div class=\"fusion-image-element\" style=\"--awb-caption-title-font-family:var(--h2_typography-font-family);--awb-caption-title-font-weight:var(--h2_typography-font-weight);--awb-caption-title-font-style:var(--h2_typography-font-style);--awb-caption-title-size:var(--h2_typography-font-size);--awb-caption-title-transform:var(--h2_typography-text-transform);--awb-caption-title-line-height:var(--h2_typography-line-height);--awb-caption-title-letter-spacing:var(--h2_typography-letter-spacing);\"><span class=\"fusion-imageframe imageframe-none imageframe-6 hover-type-none\"><img decoding=\"async\" width=\"1400\" height=\"329\" title=\"f\u00f3rmula\" src=\"https:\/\/www.artefact.com\/\/wp-content\/uploads\/2023\/07\/formule.webp\" data-orig-src=\"https:\/\/www.artefact.com\/\/wp-content\/uploads\/2023\/07\/formule.webp\" alt class=\"lazyload img-responsive wp-image-71583\" srcset=\"data:image\/svg+xml,%3Csvg%20xmlns%3D%27http%3A%2F%2Fwww.w3.org%2F2000%2Fsvg%27%20width%3D%271400%27%20height%3D%27329%27%20viewBox%3D%270%200%201400%20329%27%3E%3Crect%20width%3D%271400%27%20height%3D%27329%27%20fill-opacity%3D%220%22%2F%3E%3C%2Fsvg%3E\" data-srcset=\"https:\/\/www.artefact.com\/\/wp-content\/uploads\/2023\/07\/formule-200x47.webp 200w, https:\/\/www.artefact.com\/\/wp-content\/uploads\/2023\/07\/formule-400x94.webp 400w, https:\/\/www.artefact.com\/\/wp-content\/uploads\/2023\/07\/formule-600x141.webp 600w, https:\/\/www.artefact.com\/\/wp-content\/uploads\/2023\/07\/formule-800x188.webp 800w, https:\/\/www.artefact.com\/\/wp-content\/uploads\/2023\/07\/formule-1200x282.webp 1200w, https:\/\/www.artefact.com\/\/wp-content\/uploads\/2023\/07\/formule.webp 1400w\" data-sizes=\"auto\" data-orig-sizes=\"(max-width: 640px) 100vw, 1400px\" \/><\/span><\/div><div class=\"fusion-text fusion-text-18\"><p id=\"3f94\" data-selectable-paragraph=\"\">Em geral, o m\u00e9todo provou ser altamente eficiente,\u00a0<strong>resultando em uma redu\u00e7\u00e3o absoluta m\u00e9dia na tend\u00eancia de 9,82% e um aumento absoluto m\u00e9dio na precis\u00e3o da previs\u00e3o de 6,29%<\/strong>\u00a0nos 9 escopos de produtos e 5 dobras de valida\u00e7\u00e3o cruzada.<\/p>\n<p id=\"22b9\" data-selectable-paragraph=\"\">A pr\u00f3xima se\u00e7\u00e3o valida a relev\u00e2ncia do nosso m\u00e9todo testando-o em um dataset p\u00fablico.<\/p>\n<\/div><div class=\"fusion-title title fusion-title-12 fusion-sep-none fusion-title-text fusion-title-size-three\" style=\"--awb-margin-bottom-small:8px;\"><h3 class=\"fusion-title-heading title-heading-left fusion-responsive-typography-calculated\" style=\"margin:0;--fontSize:20;line-height:1.2;\">Vendas em lojas p\u00fablicas dataset<\/h3><\/div><div class=\"fusion-text fusion-text-19\"><p id=\"fbf3\" data-selectable-paragraph=\"\">Neste estudo de caso simplificado, usamos o\u00a0<a href=\"https:\/\/www.kaggle.com\/competitions\/store-sales-time-series-forecasting\/data\" target=\"_blank\" rel=\"noopener ugc nofollow\">Vendas na loja - Previs\u00e3o de s\u00e9ries temporais<\/a>\u00a0Kaggle dataset. Esse dataset apresenta uma tend\u00eancia acentuada ao examinar a s\u00e9rie temporal m\u00e9dia de vendas, o que torna nosso m\u00e9todo particularmente relevante. Al\u00e9m disso, o horizonte de previs\u00e3o escolhido \u00e9 de tr\u00eas meses, o que \u00e9 distante o suficiente para se beneficiar dos recursos extras de extrapola\u00e7\u00e3o da codifica\u00e7\u00e3o din\u00e2mica. Para fins de demonstra\u00e7\u00e3o, limitamos o dataset a 31 de mar\u00e7o de 2016, logo antes da ocorr\u00eancia de um terremoto, o que fez com que a curva de vendas se achatasse.<\/p>\n<p id=\"2ec5\" data-selectable-paragraph=\"\">Antes de qualquer codifica\u00e7\u00e3o, nosso conjunto inicial de data compreende aproximadamente 75% de caracter\u00edsticas num\u00e9ricas, incluindo defasagens, m\u00e9dias m\u00f3veis, caracter\u00edsticas de calend\u00e1rio e eventos de feriados. Os 25% restantes consistem em atributos categ\u00f3ricos, como fam\u00edlia de produtos, n\u00famero da loja, cidade e outros.<\/p>\n<p id=\"f0f6\" data-selectable-paragraph=\"\">Dois modelos distintos s\u00e3o treinados: um emprega os recursos categ\u00f3ricos que foram codificados dinamicamente usando nosso m\u00e9todo personalizado, enquanto o outro usa o tratamento nativo de recursos categ\u00f3ricos do LightGBM.<\/p>\n<p id=\"f3b9\" data-selectable-paragraph=\"\">Ao compararmos seu desempenho, observamos um aprimoramento significativo na abordagem de codifica\u00e7\u00e3o din\u00e2mica. A tabela a seguir apresenta um resumo dos resultados:<\/p>\n<\/div><div class=\"fusion-image-element\" style=\"--awb-caption-title-font-family:var(--h2_typography-font-family);--awb-caption-title-font-weight:var(--h2_typography-font-weight);--awb-caption-title-font-style:var(--h2_typography-font-style);--awb-caption-title-size:var(--h2_typography-font-size);--awb-caption-title-transform:var(--h2_typography-text-transform);--awb-caption-title-line-height:var(--h2_typography-line-height);--awb-caption-title-letter-spacing:var(--h2_typography-letter-spacing);\"><span class=\"fusion-imageframe imageframe-none imageframe-7 hover-type-none\"><img decoding=\"async\" width=\"1222\" height=\"272\" alt=\"Comparison of RMSE, FA, and %Bias between LightGBM encoding method and dynamic encoding\" title=\"imagem 1\" src=\"https:\/\/www.artefact.com\/\/wp-content\/uploads\/2023\/07\/image-1.webp\" data-orig-src=\"https:\/\/www.artefact.com\/\/wp-content\/uploads\/2023\/07\/image-1.webp\" class=\"lazyload img-responsive wp-image-71584\" srcset=\"data:image\/svg+xml,%3Csvg%20xmlns%3D%27http%3A%2F%2Fwww.w3.org%2F2000%2Fsvg%27%20width%3D%271222%27%20height%3D%27272%27%20viewBox%3D%270%200%201222%20272%27%3E%3Crect%20width%3D%271222%27%20height%3D%27272%27%20fill-opacity%3D%220%22%2F%3E%3C%2Fsvg%3E\" data-srcset=\"https:\/\/www.artefact.com\/\/wp-content\/uploads\/2023\/07\/image-1-200x45.webp 200w, https:\/\/www.artefact.com\/\/wp-content\/uploads\/2023\/07\/image-1-400x89.webp 400w, https:\/\/www.artefact.com\/\/wp-content\/uploads\/2023\/07\/image-1-600x134.webp 600w, https:\/\/www.artefact.com\/\/wp-content\/uploads\/2023\/07\/image-1-800x178.webp 800w, https:\/\/www.artefact.com\/\/wp-content\/uploads\/2023\/07\/image-1-1200x267.webp 1200w, https:\/\/www.artefact.com\/\/wp-content\/uploads\/2023\/07\/image-1.webp 1222w\" data-sizes=\"auto\" data-orig-sizes=\"(max-width: 640px) 100vw, 1222px\" \/><\/span><\/div><div class=\"fusion-text fusion-text-20\"><p style=\"text-align: center;\"><em>Compara\u00e7\u00e3o de RMSE, FA e %Bias entre o m\u00e9todo de codifica\u00e7\u00e3o LightGBM e a codifica\u00e7\u00e3o din\u00e2mica<\/em><\/p>\n<\/div><div class=\"fusion-image-element\" style=\"--awb-caption-title-font-family:var(--h2_typography-font-family);--awb-caption-title-font-weight:var(--h2_typography-font-weight);--awb-caption-title-font-style:var(--h2_typography-font-style);--awb-caption-title-size:var(--h2_typography-font-size);--awb-caption-title-transform:var(--h2_typography-text-transform);--awb-caption-title-line-height:var(--h2_typography-line-height);--awb-caption-title-letter-spacing:var(--h2_typography-letter-spacing);\"><span class=\"fusion-imageframe imageframe-none imageframe-8 hover-type-none\"><img decoding=\"async\" width=\"1304\" height=\"364\" title=\"imagem2\" src=\"https:\/\/www.artefact.com\/\/wp-content\/uploads\/2023\/07\/image2.webp\" data-orig-src=\"https:\/\/www.artefact.com\/\/wp-content\/uploads\/2023\/07\/image2.webp\" alt class=\"lazyload img-responsive wp-image-71585\" srcset=\"data:image\/svg+xml,%3Csvg%20xmlns%3D%27http%3A%2F%2Fwww.w3.org%2F2000%2Fsvg%27%20width%3D%271304%27%20height%3D%27364%27%20viewBox%3D%270%200%201304%20364%27%3E%3Crect%20width%3D%271304%27%20height%3D%27364%27%20fill-opacity%3D%220%22%2F%3E%3C%2Fsvg%3E\" data-srcset=\"https:\/\/www.artefact.com\/\/wp-content\/uploads\/2023\/07\/image2-200x56.webp 200w, https:\/\/www.artefact.com\/\/wp-content\/uploads\/2023\/07\/image2-400x112.webp 400w, https:\/\/www.artefact.com\/\/wp-content\/uploads\/2023\/07\/image2-600x167.webp 600w, https:\/\/www.artefact.com\/\/wp-content\/uploads\/2023\/07\/image2-800x223.webp 800w, https:\/\/www.artefact.com\/\/wp-content\/uploads\/2023\/07\/image2-1200x335.webp 1200w, https:\/\/www.artefact.com\/\/wp-content\/uploads\/2023\/07\/image2.webp 1304w\" data-sizes=\"auto\" data-orig-sizes=\"(max-width: 640px) 100vw, 1304px\" \/><\/span><\/div><div class=\"fusion-text fusion-text-21\"><p style=\"text-align: center;\"><em>M\u00e9dia de vendas semanais + previs\u00f5es de 3 meses (codifica\u00e7\u00e3o din\u00e2mica vs. m\u00e9todo de codifica\u00e7\u00e3o LightGBM)<br \/>\n<\/em><\/p>\n<\/div><div class=\"fusion-text fusion-text-22\"><p>Conforme ilustrado no gr\u00e1fico acima, o modelo que incorpora codifica\u00e7\u00f5es din\u00e2micas\u00a0<strong>captura efetivamente a tend\u00eancia e a extrapola<\/strong>, enquanto o modelo alternativo tem dificuldade para fazer isso.<\/p>\n<\/div><div class=\"fusion-title title fusion-title-13 fusion-sep-none fusion-title-text fusion-title-size-two\" style=\"--awb-margin-bottom-small:8px;\"><h2 class=\"fusion-title-heading title-heading-left fusion-responsive-typography-calculated\" style=\"margin:0;--fontSize:50;line-height:1.2;\">Uso e limites<\/h2><\/div><div class=\"fusion-text fusion-text-23\"><p>Nosso m\u00e9todo se mostra especialmente valioso em cen\u00e1rios em que a s\u00e9rie temporal exibe\u00a0<strong>tend\u00eancias acentuadas<\/strong>\u00a0e o\u00a0<strong>horizonte de previs\u00e3o<\/strong> \u00e9 distante o suficiente para se beneficiar da extrapola\u00e7\u00e3o de tend\u00eancias. Al\u00e9m disso, \u00e0 medida que codificamos e incorporamos dinamicamente\u00a0<strong>caracter\u00edsticas mais categ\u00f3ricas<\/strong>\u00a0com\u00a0<strong>preditivo significativo<\/strong> <strong>pot\u00eancia<\/strong>\u00a0no modelo,\u00a0<strong>o efeito obtido por meio de nossa abordagem sobre as previs\u00f5es aumenta<\/strong>. No entanto, \u00e9 importante reconhecer que outros m\u00e9todos de codifica\u00e7\u00e3o t\u00eam suas pr\u00f3prias vantagens e podem ser mais vantajosos em diferentes contextos. Al\u00e9m disso, existe a possibilidade de combinar os dois tipos de codifica\u00e7\u00e3o para obter resultados potencialmente melhores.<\/p>\n<\/div><div class=\"fusion-title title fusion-title-14 fusion-sep-none fusion-title-text fusion-title-size-two\" style=\"--awb-margin-bottom-small:8px;\"><h2 class=\"fusion-title-heading title-heading-left fusion-responsive-typography-calculated\" style=\"margin:0;--fontSize:50;line-height:1.2;\">Conclus\u00e3o<\/h2><\/div><ul style=\"--awb-line-height:27.2px;--awb-icon-width:27.2px;--awb-icon-height:27.2px;--awb-icon-margin:11.2px;--awb-content-margin:38.4px;\" class=\"fusion-checklist fusion-checklist-3 fusion-checklist-default type-icons paddingList dark-text\"><li class=\"fusion-li-item\" style=\"\"><span class=\"icon-wrapper circle-no\"><i class=\"fusion-li-icon awb-icon-check\" aria-hidden=\"true\"><\/i><\/span><div class=\"fusion-li-item-content\">As t\u00e9cnicas de codifica\u00e7\u00e3o convencionais para recursos categ\u00f3ricos n\u00e3o s\u00e3o ideais para previs\u00f5es, principalmente quando as s\u00e9ries temporais apresentam tend\u00eancias acentuadas e o horizonte de previs\u00e3o \u00e9 distante.<\/div><\/li><li class=\"fusion-li-item\" style=\"\"><span class=\"icon-wrapper circle-no\"><i class=\"fusion-li-icon awb-icon-check\" aria-hidden=\"true\"><\/i><\/span><div class=\"fusion-li-item-content\">\n<p>Nosso m\u00e9todo \u00e9 uma varia\u00e7\u00e3o do empilhamento de modelos, pois empregamos um modelo Prophet - que apresenta recursos superiores para modelagem e extrapola\u00e7\u00e3o de tend\u00eancias - para construir a codifica\u00e7\u00e3o dos recursos categ\u00f3ricos.<\/p>\n<\/div><\/li><li class=\"fusion-li-item\" style=\"\"><span class=\"icon-wrapper circle-no\"><i class=\"fusion-li-icon awb-icon-check\" aria-hidden=\"true\"><\/i><\/span><div class=\"fusion-li-item-content\">\n<p>Nossos experimentos demonstraram as vantagens de reduzir o vi\u00e9s e aumentar a precis\u00e3o da previs\u00e3o.<\/p>\n<\/div><\/li><\/ul><div class=\"fusion-text fusion-text-24\"><p>Temos planos de publicar um documento nos pr\u00f3ximos meses, que incluir\u00e1 detalhes completos de nossa abordagem e implementa\u00e7\u00e3o. <a href=\"https:\/\/www.artefact.com\/br\/blog\/\">Fique atento<\/a> para mais atualiza\u00e7\u00f5es!<\/p>\n<\/div><\/div><\/div><\/div><\/article><div class=\"fusion-fullwidth fullwidth-box fusion-builder-row-5 fusion-flex-container nonhundred-percent-fullwidth non-hundred-percent-height-scrolling\" style=\"--awb-border-radius-top-left:0px;--awb-border-radius-top-right:0px;--awb-border-radius-bottom-right:0px;--awb-border-radius-bottom-left:0px;--awb-margin-top:40px;--awb-margin-bottom:40px;--awb-flex-wrap:wrap;\" ><div class=\"fusion-builder-row fusion-row fusion-flex-align-items-center fusion-flex-justify-content-center fusion-flex-content-wrap\" style=\"max-width:calc( 1440px + 20px );margin-left: calc(-20px \/ 2 );margin-right: calc(-20px \/ 2 );\"><div class=\"fusion-layout-column fusion_builder_column fusion-builder-column-4 fusion_builder_column_1_1 1_1 fusion-flex-column fusion-flex-align-self-center\" style=\"--awb-padding-top:40px;--awb-padding-right:40px;--awb-padding-bottom:40px;--awb-padding-left:40px;--awb-overflow:hidden;--awb-bg-position:left center;--awb-bg-size:cover;--awb-border-color:rgba(10,17,40,0.1);--awb-border-style:solid;--awb-border-radius:4px 4px 4px 4px;--awb-width-large:100%;--awb-margin-top-large:0px;--awb-spacing-right-large:10px;--awb-margin-bottom-large:0px;--awb-spacing-left-large:10px;--awb-width-medium:100%;--awb-order-medium:0;--awb-spacing-right-medium:10px;--awb-spacing-left-medium:10px;--awb-width-small:100%;--awb-order-small:0;--awb-spacing-right-small:10px;--awb-spacing-left-small:10px;\"><div class=\"fusion-column-wrapper lazyload fusion-column-has-shadow fusion-flex-justify-content-center fusion-content-layout-column fusion-column-has-bg-image\" data-bg-url=\"https:\/\/www.artefact.com\/\/wp-content\/uploads\/2021\/03\/background.jpg\" data-bg=\"https:\/\/www.artefact.com\/\/wp-content\/uploads\/2021\/03\/background.jpg\"><div class=\"fusion-image-element\" style=\"text-align:center;--awb-margin-right:20px;--awb-margin-left:20px;--awb-max-width:150px;--awb-caption-title-font-family:var(--h2_typography-font-family);--awb-caption-title-font-weight:var(--h2_typography-font-weight);--awb-caption-title-font-style:var(--h2_typography-font-style);--awb-caption-title-size:var(--h2_typography-font-size);--awb-caption-title-transform:var(--h2_typography-text-transform);--awb-caption-title-line-height:var(--h2_typography-line-height);--awb-caption-title-letter-spacing:var(--h2_typography-letter-spacing);\"><span class=\"fusion-imageframe imageframe-none imageframe-9 hover-type-none\"><img decoding=\"async\" width=\"72\" height=\"41\" title=\"m\u00e9dio\" src=\"data:image\/svg+xml,%3Csvg%20xmlns%3D%27http%3A%2F%2Fwww.w3.org%2F2000%2Fsvg%27%20width%3D%2772%27%20height%3D%2741%27%20viewBox%3D%270%200%2072%2041%27%3E%3Crect%20width%3D%2772%27%20height%3D%2741%27%20fill-opacity%3D%220%22%2F%3E%3C%2Fsvg%3E\" data-orig-src=\"https:\/\/www.artefact.com\/\/wp-content\/uploads\/2021\/03\/medium.png\" alt class=\"lazyload img-responsive wp-image-60927\"\/><\/span><\/div><div class=\"fusion-title title fusion-title-15 fusion-sep-none fusion-title-center fusion-title-text fusion-title-size-three\" style=\"--awb-margin-top:20px;--awb-margin-bottom:0px;--awb-margin-bottom-small:8px;\"><h3 class=\"fusion-title-heading title-heading-center fusion-responsive-typography-calculated\" style=\"margin:0;--fontSize:20;line-height:1.2;\">M\u00e9dia Blog por Artefact.<\/h3><\/div><div class=\"fusion-text fusion-text-25\" style=\"--awb-content-alignment:center;\"><p>Este artigo foi publicado inicialmente no <strong>Medium.com<\/strong>.<br \/>\nSiga-nos em nosso Medium Blog !<\/p>\n<\/div><div style=\"text-align:center;\"><a class=\"fusion-button button-flat button-medium button-default fusion-button-default button-1 fusion-button-default-span fusion-button-default-type\" target=\"_blank\" rel=\"noopener noreferrer\" href=\"https:\/\/medium.com\/artefact-engineering-and-data-science\/encoding-categorical-features-in-forecasting-are-we-all-doing-it-wrong-fe8a9a6488da\"><span class=\"fusion-button-text awb-button__text awb-button__text--default\">Leia nosso artigo<\/span><\/a><\/div><\/div><\/div><\/div><\/div><\/p>","protected":false},"excerpt":{"rendered":"<p>Propomos um novo m\u00e9todo para codificar recursos categ\u00f3ricos especificamente adaptados para aplicativos de previs\u00e3o.<\/p>","protected":false},"featured_media":71575,"parent":0,"template":"","meta":{"_acf_changed":false,"ep_exclude_from_search":false},"blog-category":[21939],"blog-language":[2991],"class_list":["post-71574","blog","type-blog","status-publish","has-post-thumbnail","hentry","blog-category-medium","blog-language-en"],"acf":[],"_links":{"self":[{"href":"https:\/\/www.artefact.com\/br\/wp-json\/wp\/v2\/blog\/71574","targetHints":{"allow":["GET"]}}],"collection":[{"href":"https:\/\/www.artefact.com\/br\/wp-json\/wp\/v2\/blog"}],"about":[{"href":"https:\/\/www.artefact.com\/br\/wp-json\/wp\/v2\/types\/blog"}],"wp:featuredmedia":[{"embeddable":true,"href":"https:\/\/www.artefact.com\/br\/wp-json\/wp\/v2\/media\/71575"}],"wp:attachment":[{"href":"https:\/\/www.artefact.com\/br\/wp-json\/wp\/v2\/media?parent=71574"}],"wp:term":[{"taxonomy":"blog-category","embeddable":true,"href":"https:\/\/www.artefact.com\/br\/wp-json\/wp\/v2\/blog-category?post=71574"},{"taxonomy":"blog-language","embeddable":true,"href":"https:\/\/www.artefact.com\/br\/wp-json\/wp\/v2\/blog-language?post=71574"}],"curies":[{"name":"wp","href":"https:\/\/api.w.org\/{rel}","templated":true}]}}