{"id":50,"date":"2023-11-26T21:50:26","date_gmt":"2023-11-27T00:50:26","guid":{"rendered":"http:\/\/localhost\/sad\/?page_id=6"},"modified":"2025-11-18T21:53:16","modified_gmt":"2025-11-18T21:53:16","slug":"mineracao-de-texto","status":"publish","type":"page","link":"https:\/\/wordpress.ft.unicamp.br\/revisa\/mineracao-de-texto\/","title":{"rendered":"Minera\u00e7\u00e3o de Texto"},"content":{"rendered":"\t\t<div data-elementor-type=\"wp-page\" data-elementor-id=\"50\" class=\"elementor elementor-50\">\n\t\t\t\t<div class=\"elementor-element elementor-element-2004c65d e-flex e-con-boxed e-con e-parent\" data-id=\"2004c65d\" data-element_type=\"container\" data-e-type=\"container\">\n\t\t\t\t\t<div class=\"e-con-inner\">\n\t\t\t\t<div class=\"elementor-element elementor-element-4851a3a elementor-widget elementor-widget-heading\" data-id=\"4851a3a\" data-element_type=\"widget\" data-e-type=\"widget\" data-widget_type=\"heading.default\">\n\t\t\t\t<div class=\"elementor-widget-container\">\n\t\t\t\t\t<h2 class=\"elementor-heading-title elementor-size-default\">1. O que \u00e9 Minera\u00e7\u00e3o de Texto?<\/h2>\t\t\t\t<\/div>\n\t\t\t\t<\/div>\n\t\t\t\t<div class=\"elementor-element elementor-element-c7f6716 elementor-widget elementor-widget-text-editor\" data-id=\"c7f6716\" data-element_type=\"widget\" data-e-type=\"widget\" data-widget_type=\"text-editor.default\">\n\t\t\t\t<div class=\"elementor-widget-container\">\n\t\t\t\t\t\t\t\t\t<p>A minera\u00e7\u00e3o de texto, como disciplina inovadora e vital no campo da ci\u00eancia de dados, est\u00e1 intrinsecamente ligada \u00e0 extra\u00e7\u00e3o de conhecimento a partir de dados textuais n\u00e3o estruturados. Para compreender profundamente essa pr\u00e1tica, \u00e9 essencial desmembrar suas principais caracter\u00edsticas e objetivos.<\/p>\t\t\t\t\t\t\t\t<\/div>\n\t\t\t\t<\/div>\n\t\t\t\t<div class=\"elementor-element elementor-element-049b299 elementor-widget elementor-widget-heading\" data-id=\"049b299\" data-element_type=\"widget\" data-e-type=\"widget\" data-widget_type=\"heading.default\">\n\t\t\t\t<div class=\"elementor-widget-container\">\n\t\t\t\t\t<h3 class=\"elementor-heading-title elementor-size-default\">1.1. Defini\u00e7\u00e3o e Escopo:<\/h3>\t\t\t\t<\/div>\n\t\t\t\t<\/div>\n\t\t\t\t<div class=\"elementor-element elementor-element-a5a60c3 elementor-widget elementor-widget-text-editor\" data-id=\"a5a60c3\" data-element_type=\"widget\" data-e-type=\"widget\" data-widget_type=\"text-editor.default\">\n\t\t\t\t<div class=\"elementor-widget-container\">\n\t\t\t\t\t\t\t\t\t<p>A minera\u00e7\u00e3o de texto, tamb\u00e9m conhecida como an\u00e1lise de texto, refere-se \u00e0 aplica\u00e7\u00e3o de m\u00e9todos e t\u00e9cnicas computacionais para extrair padr\u00f5es, informa\u00e7\u00f5es e conhecimentos relevantes de grandes conjuntos de dados textuais. Diferentemente de dados estruturados, como tabelas em bancos de dados, os dados textuais s\u00e3o caracterizados por sua natureza n\u00e3o linear e contextual, exigindo abordagens espec\u00edficas para revelar sua riqueza informativa.<\/p>\t\t\t\t\t\t\t\t<\/div>\n\t\t\t\t<\/div>\n\t\t\t\t<div class=\"elementor-element elementor-element-50908d9 elementor-widget elementor-widget-heading\" data-id=\"50908d9\" data-element_type=\"widget\" data-e-type=\"widget\" data-widget_type=\"heading.default\">\n\t\t\t\t<div class=\"elementor-widget-container\">\n\t\t\t\t\t<h4 class=\"elementor-heading-title elementor-size-default\">1.2. Objetivos Principais:<\/h4>\t\t\t\t<\/div>\n\t\t\t\t<\/div>\n\t\t\t\t<div class=\"elementor-element elementor-element-59d2926 elementor-widget elementor-widget-text-editor\" data-id=\"59d2926\" data-element_type=\"widget\" data-e-type=\"widget\" data-widget_type=\"text-editor.default\">\n\t\t\t\t<div class=\"elementor-widget-container\">\n\t\t\t\t\t\t\t\t\t<h1><span style=\"font-family: Roboto, sans-serif;font-weight: 400;font-size: 16px\">Identifica\u00e7\u00e3o de Padr\u00f5es Lingu\u00edsticos: A minera\u00e7\u00e3o de texto visa descobrir padr\u00f5es lingu\u00edsticos e estruturas sem\u00e2nticas nos documentos, permitindo uma compreens\u00e3o mais profunda do conte\u00fado textual.<\/span><\/h1>\n<p>Extra\u00e7\u00e3o de Informa\u00e7\u00f5es Chave: Uma meta crucial \u00e9 extrair informa\u00e7\u00f5es espec\u00edficas e relevantes de documentos, como datas, eventos, locais e entidades, proporcionando uma s\u00edntese eficaz do conte\u00fado.<\/p>\n<p><!-- \/wp:heading --><!-- wp:paragraph --><!-- \/wp:paragraph --><!-- wp:paragraph --><!-- \/wp:paragraph --><!-- wp:paragraph --><!-- \/wp:paragraph --><!-- wp:paragraph --><!-- \/wp:paragraph --><!-- wp:paragraph --><!-- \/wp:paragraph --><!-- wp:paragraph --><!-- \/wp:paragraph --><!-- wp:paragraph --><\/p>\n<p>An\u00e1lise de Sentimento e Opini\u00e3o: Por meio da an\u00e1lise de sentimento, a minera\u00e7\u00e3o de texto permite avaliar o tom emocional de documentos, possibilitando a compreens\u00e3o da opini\u00e3o p\u00fablica sobre determinados t\u00f3picos.<\/p>\t\t\t\t\t\t\t\t<\/div>\n\t\t\t\t<\/div>\n\t\t\t\t<div class=\"elementor-element elementor-element-f762c3d elementor-widget elementor-widget-heading\" data-id=\"f762c3d\" data-element_type=\"widget\" data-e-type=\"widget\" data-widget_type=\"heading.default\">\n\t\t\t\t<div class=\"elementor-widget-container\">\n\t\t\t\t\t<h4 class=\"elementor-heading-title elementor-size-default\">1.3. Import\u00e2ncia na Era da Informa\u00e7\u00e3o:<\/h4>\t\t\t\t<\/div>\n\t\t\t\t<\/div>\n\t\t\t\t<div class=\"elementor-element elementor-element-9f84da2 elementor-widget elementor-widget-text-editor\" data-id=\"9f84da2\" data-element_type=\"widget\" data-e-type=\"widget\" data-widget_type=\"text-editor.default\">\n\t\t\t\t<div class=\"elementor-widget-container\">\n\t\t\t\t\t\t\t\t\t<p>Na era da informa\u00e7\u00e3o, onde grandes volumes de dados s\u00e3o gerados a cada segundo, a minera\u00e7\u00e3o de texto torna-se uma ferramenta indispens\u00e1vel. Seja para entender o comportamento do consumidor, analisar tend\u00eancias de mercado ou automatizar processos de tomada de decis\u00e3o, essa disciplina desempenha um papel crucial na transforma\u00e7\u00e3o de dados textuais aparentemente ca\u00f3ticos em insights acion\u00e1veis.<\/p>\n<p><!-- \/wp:paragraph --><!-- wp:paragraph --><!-- \/wp:paragraph --><!-- wp:paragraph --><!-- \/wp:paragraph --><!-- wp:paragraph --><!-- \/wp:paragraph --><!-- wp:paragraph --><!-- \/wp:paragraph --><!-- wp:heading {\"level\":1} --><\/p>\t\t\t\t\t\t\t\t<\/div>\n\t\t\t\t<\/div>\n\t\t\t\t<div class=\"elementor-element elementor-element-ca6cd13 elementor-widget elementor-widget-heading\" data-id=\"ca6cd13\" data-element_type=\"widget\" data-e-type=\"widget\" data-widget_type=\"heading.default\">\n\t\t\t\t<div class=\"elementor-widget-container\">\n\t\t\t\t\t<h4 class=\"elementor-heading-title elementor-size-default\">1.4. Rela\u00e7\u00e3o com Processamento de Linguagem Natural (PLN):<\/h4>\t\t\t\t<\/div>\n\t\t\t\t<\/div>\n\t\t\t\t<div class=\"elementor-element elementor-element-885b233 elementor-widget elementor-widget-text-editor\" data-id=\"885b233\" data-element_type=\"widget\" data-e-type=\"widget\" data-widget_type=\"text-editor.default\">\n\t\t\t\t<div class=\"elementor-widget-container\">\n\t\t\t\t\t\t\t\t\t<p>A minera\u00e7\u00e3o de texto est\u00e1 intimamente associada ao Processamento de Linguagem Natural (PLN), uma sub\u00e1rea da intelig\u00eancia artificial dedicada \u00e0 intera\u00e7\u00e3o entre computadores e linguagem humana. Ao incorporar t\u00e9cnicas PLN, a minera\u00e7\u00e3o de texto consegue lidar com a complexidade da linguagem natural, incluindo ambiguidades sem\u00e2nticas, varia\u00e7\u00f5es sint\u00e1ticas e nuances contextuais.<\/p>\t\t\t\t\t\t\t\t<\/div>\n\t\t\t\t<\/div>\n\t\t\t\t<div class=\"elementor-element elementor-element-30b6d70 elementor-widget elementor-widget-heading\" data-id=\"30b6d70\" data-element_type=\"widget\" data-e-type=\"widget\" data-widget_type=\"heading.default\">\n\t\t\t\t<div class=\"elementor-widget-container\">\n\t\t\t\t\t<h2 class=\"elementor-heading-title elementor-size-default\">2. Dificuldades na Minera\u00e7\u00e3o de Texto:<\/h2>\t\t\t\t<\/div>\n\t\t\t\t<\/div>\n\t\t\t\t<div class=\"elementor-element elementor-element-56ae6f8 elementor-widget elementor-widget-text-editor\" data-id=\"56ae6f8\" data-element_type=\"widget\" data-e-type=\"widget\" data-widget_type=\"text-editor.default\">\n\t\t\t\t<div class=\"elementor-widget-container\">\n\t\t\t\t\t\t\t\t\t<p>Navegando pelos Desafios Inerentes. A minera\u00e7\u00e3o de texto, apesar de ser uma ferramenta poderosa na extra\u00e7\u00e3o de conhecimento de dados textuais, enfrenta uma s\u00e9rie de desafios intrincados. Compreender e abordar essas dificuldades \u00e9 essencial para desenvolver abordagens eficazes e promover avan\u00e7os significativos nesta disciplina.<\/p>\t\t\t\t\t\t\t\t<\/div>\n\t\t\t\t<\/div>\n\t\t\t\t<div class=\"elementor-element elementor-element-e01f279 elementor-widget elementor-widget-heading\" data-id=\"e01f279\" data-element_type=\"widget\" data-e-type=\"widget\" data-widget_type=\"heading.default\">\n\t\t\t\t<div class=\"elementor-widget-container\">\n\t\t\t\t\t<h4 class=\"elementor-heading-title elementor-size-default\">2.1. Ambiguidade Lingu\u00edstica:<\/h4>\t\t\t\t<\/div>\n\t\t\t\t<\/div>\n\t\t\t\t<div class=\"elementor-element elementor-element-0d5f628 elementor-widget elementor-widget-text-editor\" data-id=\"0d5f628\" data-element_type=\"widget\" data-e-type=\"widget\" data-widget_type=\"text-editor.default\">\n\t\t\t\t<div class=\"elementor-widget-container\">\n\t\t\t\t\t\t\t\t\t<p>Um dos desafios primordiais na minera\u00e7\u00e3o de texto \u00e9 a ambiguidade lingu\u00edstica. Palavras frequentemente t\u00eam significados diferentes em contextos distintos, e a interpreta\u00e7\u00e3o correta depende fortemente do contexto em que s\u00e3o utilizadas. Lidar com essa ambiguidade exige a implementa\u00e7\u00e3o de algoritmos e modelos capazes de discernir o contexto apropriado para uma interpreta\u00e7\u00e3o precisa.<\/p>\t\t\t\t\t\t\t\t<\/div>\n\t\t\t\t<\/div>\n\t\t\t\t<div class=\"elementor-element elementor-element-8fb80a4 elementor-widget elementor-widget-heading\" data-id=\"8fb80a4\" data-element_type=\"widget\" data-e-type=\"widget\" data-widget_type=\"heading.default\">\n\t\t\t\t<div class=\"elementor-widget-container\">\n\t\t\t\t\t<h4 class=\"elementor-heading-title elementor-size-default\">2.2. Sarcasmo e Ironia:\n\n\n<\/h4>\t\t\t\t<\/div>\n\t\t\t\t<\/div>\n\t\t\t\t<div class=\"elementor-element elementor-element-3ec41ae elementor-widget elementor-widget-text-editor\" data-id=\"3ec41ae\" data-element_type=\"widget\" data-e-type=\"widget\" data-widget_type=\"text-editor.default\">\n\t\t\t\t<div class=\"elementor-widget-container\">\n\t\t\t\t\t\t\t\t\t<p><span style=\"letter-spacing: 0px\">A complexidade da comunica\u00e7\u00e3o humana \u00e9 exacerbada pela presen\u00e7a de sarcasmo e ironia, que s\u00e3o formas de express\u00e3o frequentemente desafiadoras para algoritmos de minera\u00e7\u00e3o de texto. A detec\u00e7\u00e3o dessas nuances lingu\u00edsticas requer modelos avan\u00e7ados capazes de compreender n\u00e3o apenas as palavras, mas tamb\u00e9m os tons e inten\u00e7\u00f5es subjacentes.<\/span><\/p>\t\t\t\t\t\t\t\t<\/div>\n\t\t\t\t<\/div>\n\t\t\t\t<div class=\"elementor-element elementor-element-12d92d4 elementor-widget elementor-widget-heading\" data-id=\"12d92d4\" data-element_type=\"widget\" data-e-type=\"widget\" data-widget_type=\"heading.default\">\n\t\t\t\t<div class=\"elementor-widget-container\">\n\t\t\t\t\t<h4 class=\"elementor-heading-title elementor-size-default\">2.3. Volume e Variedade de Dados:<\/h4>\t\t\t\t<\/div>\n\t\t\t\t<\/div>\n\t\t\t\t<div class=\"elementor-element elementor-element-71734f8 elementor-widget elementor-widget-text-editor\" data-id=\"71734f8\" data-element_type=\"widget\" data-e-type=\"widget\" data-widget_type=\"text-editor.default\">\n\t\t\t\t<div class=\"elementor-widget-container\">\n\t\t\t\t\t\t\t\t\t<p style=\"margin-bottom: 1.5em;margin-block-end: 0px;color: #7a7a7a;font-family: Roboto, sans-serif;font-size: 16px;font-style: normal;font-weight: 400;letter-spacing: normal;text-align: start;text-indent: 0px;text-transform: none;background-color: #ffffff\">\u00a0O volume exponencial de dados textuais dispon\u00edveis representa um desafio significativo. A minera\u00e7\u00e3o de grandes conjuntos de dados requer efici\u00eancia computacional e estrat\u00e9gias robustas para lidar com a diversidade textual. A variedade de fontes, estilos e formatos torna a tarefa ainda mais complexa, exigindo m\u00e9todos adapt\u00e1veis e escal\u00e1veis.<\/p>\t\t\t\t\t\t\t\t<\/div>\n\t\t\t\t<\/div>\n\t\t\t\t<div class=\"elementor-element elementor-element-52f4310 elementor-widget elementor-widget-heading\" data-id=\"52f4310\" data-element_type=\"widget\" data-e-type=\"widget\" data-widget_type=\"heading.default\">\n\t\t\t\t<div class=\"elementor-widget-container\">\n\t\t\t\t\t<h4 class=\"elementor-heading-title elementor-size-default\"><span style=\"letter-spacing: 0px\">2.4. Entendimento de Contexto:<\/span><\/h4>\t\t\t\t<\/div>\n\t\t\t\t<\/div>\n\t\t\t\t<div class=\"elementor-element elementor-element-3978ebb elementor-widget elementor-widget-text-editor\" data-id=\"3978ebb\" data-element_type=\"widget\" data-e-type=\"widget\" data-widget_type=\"text-editor.default\">\n\t\t\t\t<div class=\"elementor-widget-container\">\n\t\t\t\t\t\t\t\t\t<p>A compreens\u00e3o precisa do contexto no qual as palavras e frases s\u00e3o utilizadas \u00e9 vital para evitar interpreta\u00e7\u00f5es equivocadas. A aus\u00eancia de contextos claros pode levar a erros na an\u00e1lise, especialmente quando palavras t\u00eam m\u00faltiplos significados ou quando frases s\u00e3o amb\u00edguas.<\/p>\t\t\t\t\t\t\t\t<\/div>\n\t\t\t\t<\/div>\n\t\t\t\t<div class=\"elementor-element elementor-element-96bae00 elementor-widget elementor-widget-heading\" data-id=\"96bae00\" data-element_type=\"widget\" data-e-type=\"widget\" data-widget_type=\"heading.default\">\n\t\t\t\t<div class=\"elementor-widget-container\">\n\t\t\t\t\t<h2 class=\"elementor-heading-title elementor-size-default\">3. Modelos na Minera\u00e7\u00e3o de Texto: Navegando pelos Caminhos da Representa\u00e7\u00e3o Textual<\/h2>\t\t\t\t<\/div>\n\t\t\t\t<\/div>\n\t\t\t\t<div class=\"elementor-element elementor-element-2f04107 elementor-widget elementor-widget-text-editor\" data-id=\"2f04107\" data-element_type=\"widget\" data-e-type=\"widget\" data-widget_type=\"text-editor.default\">\n\t\t\t\t<div class=\"elementor-widget-container\">\n\t\t\t\t\t\t\t\t\t<p>Os modelos na minera\u00e7\u00e3o de texto desempenham um papel fundamental na capacidade de extrair informa\u00e7\u00f5es significativas de dados n\u00e3o estruturados. Neste cap\u00edtulo, exploraremos as principais abordagens e t\u00e9cnicas utilizadas para representar e compreender textos, desde modelos cl\u00e1ssicos at\u00e9 avan\u00e7adas arquiteturas de aprendizado profundo.<\/p>\n<p><!-- \/wp:paragraph --><!-- wp:paragraph --><\/p>\t\t\t\t\t\t\t\t<\/div>\n\t\t\t\t<\/div>\n\t\t\t\t<div class=\"elementor-element elementor-element-c19e73c elementor-widget elementor-widget-heading\" data-id=\"c19e73c\" data-element_type=\"widget\" data-e-type=\"widget\" data-widget_type=\"heading.default\">\n\t\t\t\t<div class=\"elementor-widget-container\">\n\t\t\t\t\t<h3 class=\"elementor-heading-title elementor-size-default\">3.1 Modelo Bag of Words (BoW):<\/h3>\t\t\t\t<\/div>\n\t\t\t\t<\/div>\n\t\t\t\t<div class=\"elementor-element elementor-element-5629111 elementor-widget elementor-widget-text-editor\" data-id=\"5629111\" data-element_type=\"widget\" data-e-type=\"widget\" data-widget_type=\"text-editor.default\">\n\t\t\t\t<div class=\"elementor-widget-container\">\n\t\t\t\t\t\t\t\t\t<p>O BoW, um dos modelos mais fundamentais, representa documentos como conjuntos n\u00e3o ordenados de palavras, ignorando a estrutura gramatical. Cada palavra \u00e9 tratada como uma &#8220;entidade&#8221; independente, e a frequ\u00eancia de ocorr\u00eancia \u00e9 usada para criar um vetor de caracter\u00edsticas. Embora simples, o BoW \u00e9 eficaz em tarefas b\u00e1sicas, como classifica\u00e7\u00e3o de documentos.<\/p>\t\t\t\t\t\t\t\t<\/div>\n\t\t\t\t<\/div>\n\t\t\t\t<div class=\"elementor-element elementor-element-774a53a elementor-widget elementor-widget-heading\" data-id=\"774a53a\" data-element_type=\"widget\" data-e-type=\"widget\" data-widget_type=\"heading.default\">\n\t\t\t\t<div class=\"elementor-widget-container\">\n\t\t\t\t\t<h3 class=\"elementor-heading-title elementor-size-default\">3.2 Word Embeddings:<\/h3>\t\t\t\t<\/div>\n\t\t\t\t<\/div>\n\t\t\t\t<div class=\"elementor-element elementor-element-6a04e9e elementor-widget elementor-widget-text-editor\" data-id=\"6a04e9e\" data-element_type=\"widget\" data-e-type=\"widget\" data-widget_type=\"text-editor.default\">\n\t\t\t\t<div class=\"elementor-widget-container\">\n\t\t\t\t\t\t\t\t\t<p>Modelos de Word Embeddings, como Word2Vec e GloVe, introduzem uma abordagem mais sofisticada. Eles representam palavras como vetores densos em um espa\u00e7o sem\u00e2ntico, capturando rela\u00e7\u00f5es sem\u00e2nticas entre palavras. Essa t\u00e9cnica permite a captura de significados contextuais e melhora a capacidade de representa\u00e7\u00e3o sem\u00e2ntica.<\/p>\t\t\t\t\t\t\t\t<\/div>\n\t\t\t\t<\/div>\n\t\t\t\t<div class=\"elementor-element elementor-element-bca94ae elementor-widget elementor-widget-heading\" data-id=\"bca94ae\" data-element_type=\"widget\" data-e-type=\"widget\" data-widget_type=\"heading.default\">\n\t\t\t\t<div class=\"elementor-widget-container\">\n\t\t\t\t\t<h3 class=\"elementor-heading-title elementor-size-default\">3.3 Modelos de Aprendizado Profundo:<\/h3>\t\t\t\t<\/div>\n\t\t\t\t<\/div>\n\t\t\t\t<div class=\"elementor-element elementor-element-1c45cd4 elementor-widget elementor-widget-text-editor\" data-id=\"1c45cd4\" data-element_type=\"widget\" data-e-type=\"widget\" data-widget_type=\"text-editor.default\">\n\t\t\t\t<div class=\"elementor-widget-container\">\n\t\t\t\t\t\t\t\t\t<p>Redes Neurais Recorrentes (RNNs): Essas redes s\u00e3o projetadas para lidar com dados sequenciais, sendo aplicadas a problemas em que a ordem das palavras \u00e9 crucial. No entanto, RNNs enfrentam desafios com depend\u00eancias temporais de longo prazo.<\/p>\n<p><!-- \/wp:paragraph --><!-- wp:paragraph --><\/p>\n<p>Long Short-Term Memory (LSTM): Uma extens\u00e3o das RNNs, o LSTM supera o problema das depend\u00eancias temporais de longo prazo, sendo eficaz em tarefas como an\u00e1lise de sentimentos e tradu\u00e7\u00e3o autom\u00e1tica.<\/p>\n<p><!-- \/wp:paragraph --><!-- wp:paragraph --><\/p>\n<p>Transformers: Arquiteturas como o Transformer revolucionaram a minera\u00e7\u00e3o de texto. Ao introduzir mecanismos de aten\u00e7\u00e3o, os Transformers capturam rela\u00e7\u00f5es contextuais de longo alcance, permitindo um entendimento mais profundo da estrutura textual. O BERT (Bidirectional Encoder Representations from Transformers) \u00e9 um exemplo not\u00e1vel.<\/p>\t\t\t\t\t\t\t\t<\/div>\n\t\t\t\t<\/div>\n\t\t\t\t<div class=\"elementor-element elementor-element-b7d187e elementor-widget elementor-widget-heading\" data-id=\"b7d187e\" data-element_type=\"widget\" data-e-type=\"widget\" data-widget_type=\"heading.default\">\n\t\t\t\t<div class=\"elementor-widget-container\">\n\t\t\t\t\t<h3 class=\"elementor-heading-title elementor-size-default\">3.4 Modelos de Classifica\u00e7\u00e3o:<\/h3>\t\t\t\t<\/div>\n\t\t\t\t<\/div>\n\t\t\t\t<div class=\"elementor-element elementor-element-b029372 elementor-widget elementor-widget-text-editor\" data-id=\"b029372\" data-element_type=\"widget\" data-e-type=\"widget\" data-widget_type=\"text-editor.default\">\n\t\t\t\t<div class=\"elementor-widget-container\">\n\t\t\t\t\t\t\t\t\t<p>M\u00e1quinas de Vetores de Suporte (SVM): Embora n\u00e3o seja especificamente um modelo de minera\u00e7\u00e3o de texto, SVM \u00e9 frequentemente aplicado para tarefas de classifica\u00e7\u00e3o, como categoriza\u00e7\u00e3o de documentos.<\/p>\n<p><!-- \/wp:paragraph --><!-- wp:paragraph --><\/p>\n<p>Naive Bayes: Um modelo probabil\u00edstico simples, mas eficaz, utilizado em tarefas como classifica\u00e7\u00e3o de documentos e filtragem de spam.<\/p>\n<p><!-- \/wp:paragraph --><!-- wp:paragraph --><\/p>\t\t\t\t\t\t\t\t<\/div>\n\t\t\t\t<\/div>\n\t\t\t\t<div class=\"elementor-element elementor-element-2db1a9d elementor-widget elementor-widget-heading\" data-id=\"2db1a9d\" data-element_type=\"widget\" data-e-type=\"widget\" data-widget_type=\"heading.default\">\n\t\t\t\t<div class=\"elementor-widget-container\">\n\t\t\t\t\t<h3 class=\"elementor-heading-title elementor-size-default\">3.5 Modelos de Sumariza\u00e7\u00e3o e Gera\u00e7\u00e3o de Texto:<\/h3>\t\t\t\t<\/div>\n\t\t\t\t<\/div>\n\t\t\t\t<div class=\"elementor-element elementor-element-5190fc7 elementor-widget elementor-widget-text-editor\" data-id=\"5190fc7\" data-element_type=\"widget\" data-e-type=\"widget\" data-widget_type=\"text-editor.default\">\n\t\t\t\t<div class=\"elementor-widget-container\">\n\t\t\t\t\t\t\t\t\t<p><!-- \/wp:paragraph --><!-- wp:paragraph --><\/p>\n<p>Seq2Seq (Sequence-to-Sequence): Usado para tarefas de tradu\u00e7\u00e3o autom\u00e1tica e resumo de texto.<\/p>\n<p><!-- \/wp:paragraph --><!-- wp:paragraph --><\/p>\n<p>GPT (Generative Pre-trained Transformer): Modelos como o GPT-3 t\u00eam alcan\u00e7ado resultados not\u00e1veis na gera\u00e7\u00e3o de texto coerente e contextualmente relevante.<\/p>\t\t\t\t\t\t\t\t<\/div>\n\t\t\t\t<\/div>\n\t\t\t\t<div class=\"elementor-element elementor-element-f2aa29d elementor-widget elementor-widget-heading\" data-id=\"f2aa29d\" data-element_type=\"widget\" data-e-type=\"widget\" data-widget_type=\"heading.default\">\n\t\t\t\t<div class=\"elementor-widget-container\">\n\t\t\t\t\t<h2 class=\"elementor-heading-title elementor-size-default\"><h2 class=\"wp-block-heading\"><strong>Cap\u00edtulo 4: Como Fazer Minera\u00e7\u00e3o de Texto<\/strong><\/h2><\/h2>\t\t\t\t<\/div>\n\t\t\t\t<\/div>\n\t\t\t\t<div class=\"elementor-element elementor-element-f611d47 elementor-widget elementor-widget-text-editor\" data-id=\"f611d47\" data-element_type=\"widget\" data-e-type=\"widget\" data-widget_type=\"text-editor.default\">\n\t\t\t\t<div class=\"elementor-widget-container\">\n\t\t\t\t\t\t\t\t\t<p>A minera\u00e7\u00e3o de texto, embora desafiadora, pode ser acess\u00edvel com a aplica\u00e7\u00e3o de m\u00e9todos e t\u00e9cnicas adequadas. Este cap\u00edtulo fornece um tutorial detalhado, abrangendo desde o pr\u00e9-processamento de dados at\u00e9 a aplica\u00e7\u00e3o de algoritmos de minera\u00e7\u00e3o de texto. Siga os passos cuidadosamente para extrair insights valiosos de seus dados textuais.<\/p>\t\t\t\t\t\t\t\t<\/div>\n\t\t\t\t<\/div>\n\t\t\t\t<div class=\"elementor-element elementor-element-cb41d6e elementor-widget elementor-widget-heading\" data-id=\"cb41d6e\" data-element_type=\"widget\" data-e-type=\"widget\" data-widget_type=\"heading.default\">\n\t\t\t\t<div class=\"elementor-widget-container\">\n\t\t\t\t\t<h3 class=\"elementor-heading-title elementor-size-default\">4.1 Pr\u00e9-processamento de Dados:<\/h3>\t\t\t\t<\/div>\n\t\t\t\t<\/div>\n\t\t\t\t<div class=\"elementor-element elementor-element-bb31162 elementor-widget elementor-widget-text-editor\" data-id=\"bb31162\" data-element_type=\"widget\" data-e-type=\"widget\" data-widget_type=\"text-editor.default\">\n\t\t\t\t<div class=\"elementor-widget-container\">\n\t\t\t\t\t\t\t\t\t<p>Antes de come\u00e7ar a minera\u00e7\u00e3o, \u00e9 crucial preparar os dados. O pr\u00e9-processamento inclui:<\/p>\t\t\t\t\t\t\t\t<\/div>\n\t\t\t\t<\/div>\n\t\t\t\t<div class=\"elementor-element elementor-element-303d733 elementor-widget elementor-widget-heading\" data-id=\"303d733\" data-element_type=\"widget\" data-e-type=\"widget\" data-widget_type=\"heading.default\">\n\t\t\t\t<div class=\"elementor-widget-container\">\n\t\t\t\t\t<h4 class=\"elementor-heading-title elementor-size-default\">4.1.1 Tokeniza\u00e7\u00e3o:<\/h4>\t\t\t\t<\/div>\n\t\t\t\t<\/div>\n\t\t\t\t<div class=\"elementor-element elementor-element-82639ac elementor-widget elementor-widget-text-editor\" data-id=\"82639ac\" data-element_type=\"widget\" data-e-type=\"widget\" data-widget_type=\"text-editor.default\">\n\t\t\t\t<div class=\"elementor-widget-container\">\n\t\t\t\t\t\t\t\t\t<p class=\"wp-block-heading\">Divida os textos em palavras individuais (tokens). Bibliotecas como NLTK ou spaCy podem ser \u00fateis.<\/p>\n<p><!-- \/wp:heading --><!-- wp:paragraph --><\/p>\n<p><!-- \/wp:paragraph --><!-- wp:paragraph --><\/p>\n<p><!-- \/wp:paragraph --><!-- wp:paragraph --><\/p>\n<p><!-- \/wp:paragraph --><!-- wp:paragraph --><\/p>\n<p><!-- \/wp:paragraph --><!-- wp:paragraph --><\/p>\t\t\t\t\t\t\t\t<\/div>\n\t\t\t\t<\/div>\n\t\t\t\t<div class=\"elementor-element elementor-element-27d266a7 elementor-widget elementor-widget-text-editor\" data-id=\"27d266a7\" data-element_type=\"widget\" data-e-type=\"widget\" data-widget_type=\"text-editor.default\">\n\t\t\t\t<div class=\"elementor-widget-container\">\n\t\t\t\t\t\t\t\t\t<div class=\"wp-block-uagb-container uagb-block-e656d9d1 alignfull uagb-is-root-container\">\n<div class=\"uagb-container-inner-blocks-wrap\">\n<p><!-- wp:heading {\"level\":1} \/--><!-- wp:paragraph \/--><!-- wp:paragraph \/--><!-- wp:paragraph --><\/p>\n<p><!-- \/wp:paragraph --><!-- wp:paragraph --><\/p>\n<p><!-- \/wp:paragraph --><!-- wp:paragraph --><\/p>\n<p><!-- \/wp:paragraph --><!-- wp:paragraph --><\/p>\n<p><!-- \/wp:paragraph --><!-- wp:heading {\"level\":1} --><\/p>\n<p><!-- \/wp:heading --><!-- wp:paragraph --><\/p>\n<p><!-- \/wp:paragraph --><!-- wp:paragraph --><\/p>\n<p><!-- \/wp:paragraph --><!-- wp:paragraph --><\/p>\n<p><!-- \/wp:paragraph --><!-- wp:paragraph --><\/p>\n<p><!-- \/wp:paragraph --><!-- wp:paragraph --><\/p>\n<p><!-- \/wp:paragraph --><!-- wp:paragraph --><\/p>\n<p><!-- \/wp:paragraph --><!-- wp:paragraph --><\/p>\n<p><!-- \/wp:paragraph --><!-- wp:paragraph --><\/p>\n<p><!-- \/wp:paragraph --><!-- wp:heading {\"level\":1} --><\/p>\n<h1>\u00a0<\/h1>\n<h1>\u00a0<\/h1>\n<p>\u00a0<\/p>\n<p><!-- \/wp:heading --><!-- wp:code {\"style\":{\"elements\":{\"link\":{\"color\":{\"text\":\"var:preset|color|white\"}}}},\"backgroundColor\":\"black\",\"textColor\":\"white\"} --><\/p>\n<pre class=\"wp-block-code has-white-color has-black-background-color has-text-color has-background has-link-color\"><code>def preprocess_text(text):\n    tokens = word_tokenize(text)\n    tokens = [word.lower() for word in tokens if word.isalpha()]\n    tokens = [word for word in tokens if word not in stop_words]\n    return tokens<\/code><\/pre>\n<p><!-- \/wp:code --><!-- wp:paragraph --><\/p>\n<p>4.1.2 Remo\u00e7\u00e3o de Stop Words:Elimine palavras comuns que n\u00e3o contribuem para a an\u00e1lise, como &#8220;a&#8221;, &#8220;de&#8221;, etc.<\/p>\n<p><!-- \/wp:paragraph --><!-- wp:code {\"style\":{\"elements\":{\"link\":{\"color\":{\"text\":\"var:preset|color|white\"}}}},\"backgroundColor\":\"black\",\"textColor\":\"white\"} --><\/p>\n<pre class=\"wp-block-code has-white-color has-black-background-color has-text-color has-background has-link-color\"><code>from nltk.corpus import stopwords\ntokens = [word for word in tokens if word not in stop_words]\n<\/code><\/pre>\n<p><!-- \/wp:code --><!-- wp:paragraph --><\/p>\n<p>4.2 Representa\u00e7\u00e3o de Texto:<\/p>\n<p><!-- \/wp:paragraph --><!-- wp:paragraph --><\/p>\n<p>Escolha como representar o texto, considerando a natureza da tarefa.<\/p>\n<p><!-- \/wp:paragraph --><!-- wp:paragraph --><\/p>\n<p>4.3 Aplica\u00e7\u00e3o de Algoritmos:<\/p>\n<p><!-- \/wp:paragraph --><!-- wp:paragraph --><\/p>\n<p>Escolha e aplique os algoritmos de acordo com a tarefa espec\u00edfica.<\/p>\n<p><!-- \/wp:paragraph --><!-- wp:paragraph --><\/p>\n<p>4.4 Avalia\u00e7\u00e3o e Ajuste:<\/p>\n<p><!-- \/wp:paragraph --><!-- wp:paragraph --><\/p>\n<p>Avalie o desempenho do modelo e ajuste conforme necess\u00e1rio.<\/p>\n<p><!-- \/wp:paragraph --><!-- wp:code {\"style\":{\"elements\":{\"link\":{\"color\":{\"text\":\"var:preset|color|white\"}}}},\"backgroundColor\":\"black\",\"textColor\":\"white\"} --><\/p>\n<pre class=\"wp-block-code has-white-color has-black-background-color has-text-color has-background has-link-color\"><code>import pandas as pd\nfrom nltk.tokenize import word_tokenize\nfrom nltk.corpus import stopwords\nfrom keras.preprocessing.text import Tokenizer\nfrom keras.preprocessing.sequence import pad_sequences\nfrom keras.models import Sequential\nfrom keras.layers import Embedding, LSTM, Dense\nfrom sklearn.model_selection import train_test_split\nfrom sklearn.metrics import accuracy_score, precision_score, recall_score, f1_score\n# Carregar o conjunto de dados do arquivo local\ndf = pd.read_csv('bible.csv')\n# Pr\u00e9-processamento do texto\nstop_words = set(stopwords.words('english'))\ndef preprocess_text(text):\n    tokens = word_tokenize(text)\n    tokens = [word.lower() for word in tokens if word.isalpha()]\n    tokens = [word for word in tokens if word not in stop_words]\n    return ' '.join(tokens)  # Junte os tokens de volta em uma string\n# Aplicar pr\u00e9-processamento ao texto\ndf['Bible'] = df['Bible'].apply(preprocess_text)\n# Tokeniza\u00e7\u00e3o usando Keras Tokenizer\nmax_words = 1000\ntokenizer = Tokenizer(num_words=max_words, oov_token=\"&lt;OOV&gt;\")\ntokenizer.fit_on_texts(df['Bible'])\n# Convertendo texto para sequ\u00eancias de n\u00fameros\nsequences = tokenizer.texts_to_sequences(df['Bible'])\n# Padding para garantir que todas as sequ\u00eancias tenham o mesmo comprimento\nX = pad_sequences(sequences)\n# Modelo LSTM com hiperpar\u00e2metros ajust\u00e1veis\nmodel = Sequential()\nmodel.add(Embedding(input_dim=max_words, output_dim=32, input_length=X.shape[1]))\nmodel.add(LSTM(units=50, dropout=0.2, recurrent_dropout=0.2))  # Ajuste o n\u00famero de unidades e a taxa de dropout conforme necess\u00e1rio\nmodel.add(Dense(1, activation='sigmoid'))\nmodel.compile(optimizer='adam', loss='binary_crossentropy', metrics=['accuracy'])\n# Contagem da palavra 'lord'\ncount_lord = df['Bible'].str.count('lord').sum()\nprint(f'A palavra \"lord\" aparece {count_lord} vezes no texto da B\u00edblia.')\n<\/code><\/pre>\n<p><!-- \/wp:code --><!-- wp:heading {\"level\":1} --><\/p>\n<h1 class=\"wp-block-heading\"><strong>Cap\u00edtulo 5: Tutorial de Ferramentas Espec\u00edficas:<\/strong><\/h1>\n<p><!-- \/wp:heading --><!-- wp:paragraph --><\/p>\n<p>NLTK Tutorial: <a href=\"https:\/\/www.nltk.org\/book\/\" target=\"_blank\" rel=\"noreferrer noopener\">Natural Language Processing with Python &#8211; NLTK<\/a><\/p>\n<p><!-- \/wp:paragraph --><!-- wp:paragraph --><\/p>\n<p>Scikit-Learn Tutorial: <a href=\"https:\/\/scikit-learn.org\/stable\/modules\/feature_extraction.html#text-feature-extraction\" target=\"_blank\" rel=\"noreferrer noopener\">Text feature extraction<\/a><\/p>\n<p><!-- \/wp:paragraph --><!-- wp:paragraph --><\/p>\n<p>TensorFlow Tutorial: <a href=\"https:\/\/www.tensorflow.org\/tutorials\/keras\/text_classification\" target=\"_blank\" rel=\"noreferrer noopener\">Text Classification with TensorFlow<\/a><\/p>\n<p><!-- \/wp:paragraph --><!-- wp:heading {\"level\":1} --><\/p>\n<h1 class=\"wp-block-heading\"><strong>Cap\u00edtulo 6. Refer\u00eancias:<\/strong><\/h1>\n<p><!-- \/wp:heading --><!-- wp:paragraph --><\/p>\n<p>Livro: &#8220;Mining the Social Web&#8221; de Matthew A. Russell<\/p>\n<p><!-- \/wp:paragraph --><!-- wp:paragraph --><\/p>\n<p>Livro: &#8220;Natural Language Processing in Action&#8221; de Lane, Howard, e HapkeSite: <a href=\"https:\/\/towardsdatascience.com\/\" target=\"_blank\" rel=\"noreferrer noopener\">Towards Data Science<\/a><\/p>\n<\/div>\n<\/div>\n<p><!-- \/wp:paragraph --><!-- \/wp:uagb\/container --><\/p>\t\t\t\t\t\t\t\t<\/div>\n\t\t\t\t<\/div>\n\t\t\t\t<div class=\"elementor-element elementor-element-7726b50 elementor-widget elementor-widget-text-editor\" data-id=\"7726b50\" data-element_type=\"widget\" data-e-type=\"widget\" data-widget_type=\"text-editor.default\">\n\t\t\t\t<div class=\"elementor-widget-container\">\n\t\t\t\t\t\t\t\t\t<p>\u00a0<\/p>\n<p>\u00a0<\/p>\n<p>Na era da informa\u00e7\u00e3o, onde grandes volumes de dados s\u00e3o gerados a cada segundo, a minera\u00e7\u00e3o de texto torna-se uma ferramenta indispens\u00e1vel. Seja para entender o comportamento do consumidor, analisar tend\u00eancias de mercado ou automatizar processos de tomada de decis\u00e3o, essa disciplina desempenha um papel crucial na transforma\u00e7\u00e3o de dados textuais aparentemente ca\u00f3ticos em insights acion\u00e1veis.<\/p>\n<p>1.4 Rela\u00e7\u00e3o com Processamento de Linguagem Natural (PLN):<\/p>\n<p>A minera\u00e7\u00e3o de texto est\u00e1 intimamente associada ao Processamento de Linguagem Natural (PLN), uma sub\u00e1rea da intelig\u00eancia artificial dedicada \u00e0 intera\u00e7\u00e3o entre computadores e linguagem humana. Ao incorporar t\u00e9cnicas PLN, a minera\u00e7\u00e3o de texto consegue lidar com a complexidade da linguagem natural, incluindo ambiguidades sem\u00e2nticas, varia\u00e7\u00f5es sint\u00e1ticas e nuances contextuais.<\/p>\n<p><!-- \/wp:paragraph --><!-- wp:paragraph --><!-- \/wp:paragraph --><!-- wp:paragraph --><!-- \/wp:paragraph --><!-- wp:paragraph --><!-- \/wp:paragraph --><!-- wp:paragraph --><!-- \/wp:paragraph --><!-- wp:heading {\"level\":1} --><\/p>\n<h1 class=\"wp-block-heading\">\u00a0<\/h1>\t\t\t\t\t\t\t\t<\/div>\n\t\t\t\t<\/div>\n\t\t\t\t<div class=\"elementor-element elementor-element-af67ffb elementor-widget elementor-widget-text-editor\" data-id=\"af67ffb\" data-element_type=\"widget\" data-e-type=\"widget\" data-widget_type=\"text-editor.default\">\n\t\t\t\t<div class=\"elementor-widget-container\">\n\t\t\t\t\t\t\t\t\t<p>\u00a0<\/p>\n<p>\u00a0<\/p>\n<p>Na era da informa\u00e7\u00e3o, onde grandes volumes de dados s\u00e3o gerados a cada segundo, a minera\u00e7\u00e3o de texto torna-se uma ferramenta indispens\u00e1vel. Seja para entender o comportamento do consumidor, analisar tend\u00eancias de mercado ou automatizar processos de tomada de decis\u00e3o, essa disciplina desempenha um papel crucial na transforma\u00e7\u00e3o de dados textuais aparentemente ca\u00f3ticos em insights acion\u00e1veis.<\/p>\n<p>1.4 Rela\u00e7\u00e3o com Processamento de Linguagem Natural (PLN):<\/p>\n<p>A minera\u00e7\u00e3o de texto est\u00e1 intimamente associada ao Processamento de Linguagem Natural (PLN), uma sub\u00e1rea da intelig\u00eancia artificial dedicada \u00e0 intera\u00e7\u00e3o entre computadores e linguagem humana. Ao incorporar t\u00e9cnicas PLN, a minera\u00e7\u00e3o de texto consegue lidar com a complexidade da linguagem natural, incluindo ambiguidades sem\u00e2nticas, varia\u00e7\u00f5es sint\u00e1ticas e nuances contextuais.<\/p>\n<p><!-- \/wp:paragraph --><!-- wp:paragraph --><!-- \/wp:paragraph --><!-- wp:paragraph --><!-- \/wp:paragraph --><!-- wp:paragraph --><!-- \/wp:paragraph --><!-- wp:paragraph --><!-- \/wp:paragraph --><!-- wp:heading {\"level\":1} --><\/p>\n<h1 class=\"wp-block-heading\">\u00a0<\/h1>\t\t\t\t\t\t\t\t<\/div>\n\t\t\t\t<\/div>\n\t\t\t\t<div class=\"elementor-element elementor-element-9d91d88 elementor-widget elementor-widget-heading\" data-id=\"9d91d88\" data-element_type=\"widget\" data-e-type=\"widget\" data-widget_type=\"heading.default\">\n\t\t\t\t<div class=\"elementor-widget-container\">\n\t\t\t\t\t<h2 class=\"elementor-heading-title elementor-size-default\">3. Modelos na Minera\u00e7\u00e3o de Texto: Navegando pelos Caminhos da Representa\u00e7\u00e3o Textual<\/h2>\t\t\t\t<\/div>\n\t\t\t\t<\/div>\n\t\t\t\t\t<\/div>\n\t\t\t\t<\/div>\n\t\t\t\t<\/div>\n\t\t","protected":false},"excerpt":{"rendered":"<p>1. O que \u00e9 Minera\u00e7\u00e3o de Texto? A minera\u00e7\u00e3o de texto, como disciplina inovadora e vital no campo da ci\u00eancia de dados, est\u00e1 intrinsecamente ligada [&hellip;]<\/p>\n","protected":false},"author":113,"featured_media":0,"parent":0,"menu_order":0,"comment_status":"closed","ping_status":"closed","template":"elementor_theme","meta":{"footnotes":""},"class_list":["post-50","page","type-page","status-publish","hentry"],"yoast_head":"<!-- This site is optimized with the Yoast SEO plugin v27.3 - https:\/\/yoast.com\/product\/yoast-seo-wordpress\/ -->\n<title>Minera\u00e7\u00e3o de Texto - REVISA<\/title>\n<meta name=\"robots\" content=\"index, follow, max-snippet:-1, max-image-preview:large, max-video-preview:-1\" \/>\n<link rel=\"canonical\" href=\"https:\/\/wordpress.ft.unicamp.br\/revisa\/mineracao-de-texto\/\" \/>\n<meta property=\"og:locale\" content=\"pt_BR\" \/>\n<meta property=\"og:type\" content=\"article\" \/>\n<meta property=\"og:title\" content=\"Minera\u00e7\u00e3o de Texto - REVISA\" \/>\n<meta property=\"og:description\" content=\"1. O que \u00e9 Minera\u00e7\u00e3o de Texto? A minera\u00e7\u00e3o de texto, como disciplina inovadora e vital no campo da ci\u00eancia de dados, est\u00e1 intrinsecamente ligada [&hellip;]\" \/>\n<meta property=\"og:url\" content=\"https:\/\/wordpress.ft.unicamp.br\/revisa\/mineracao-de-texto\/\" \/>\n<meta property=\"og:site_name\" content=\"REVISA\" \/>\n<meta property=\"article:modified_time\" content=\"2025-11-18T21:53:16+00:00\" \/>\n<meta name=\"twitter:card\" content=\"summary_large_image\" \/>\n<meta name=\"twitter:label1\" content=\"Est. tempo de leitura\" \/>\n\t<meta name=\"twitter:data1\" content=\"8 minutos\" \/>\n<script type=\"application\/ld+json\" class=\"yoast-schema-graph\">{\"@context\":\"https:\\\/\\\/schema.org\",\"@graph\":[{\"@type\":\"WebPage\",\"@id\":\"https:\\\/\\\/wordpress.ft.unicamp.br\\\/revisa\\\/mineracao-de-texto\\\/\",\"url\":\"https:\\\/\\\/wordpress.ft.unicamp.br\\\/revisa\\\/mineracao-de-texto\\\/\",\"name\":\"Minera\u00e7\u00e3o de Texto - REVISA\",\"isPartOf\":{\"@id\":\"https:\\\/\\\/wordpress.ft.unicamp.br\\\/revisa\\\/#website\"},\"datePublished\":\"2023-11-27T00:50:26+00:00\",\"dateModified\":\"2025-11-18T21:53:16+00:00\",\"breadcrumb\":{\"@id\":\"https:\\\/\\\/wordpress.ft.unicamp.br\\\/revisa\\\/mineracao-de-texto\\\/#breadcrumb\"},\"inLanguage\":\"pt-BR\",\"potentialAction\":[{\"@type\":\"ReadAction\",\"target\":[\"https:\\\/\\\/wordpress.ft.unicamp.br\\\/revisa\\\/mineracao-de-texto\\\/\"]}]},{\"@type\":\"BreadcrumbList\",\"@id\":\"https:\\\/\\\/wordpress.ft.unicamp.br\\\/revisa\\\/mineracao-de-texto\\\/#breadcrumb\",\"itemListElement\":[{\"@type\":\"ListItem\",\"position\":1,\"name\":\"Home\",\"item\":\"https:\\\/\\\/wordpress.ft.unicamp.br\\\/revisa\\\/\"},{\"@type\":\"ListItem\",\"position\":2,\"name\":\"Minera\u00e7\u00e3o de Texto\"}]},{\"@type\":\"WebSite\",\"@id\":\"https:\\\/\\\/wordpress.ft.unicamp.br\\\/revisa\\\/#website\",\"url\":\"https:\\\/\\\/wordpress.ft.unicamp.br\\\/revisa\\\/\",\"name\":\"REVISA\",\"description\":\"A REvista VIrtual de Sistemas de Apoio a decis\u00e3o \u00e9 escrita por alunos de Sistemas de Apoio \u00e0 Decis\u00e3o da FT\\\/UNICAMP e estende a excel\u00eancia da gradua\u00e7\u00e3o para toda comunidade.\",\"potentialAction\":[{\"@type\":\"SearchAction\",\"target\":{\"@type\":\"EntryPoint\",\"urlTemplate\":\"https:\\\/\\\/wordpress.ft.unicamp.br\\\/revisa\\\/?s={search_term_string}\"},\"query-input\":{\"@type\":\"PropertyValueSpecification\",\"valueRequired\":true,\"valueName\":\"search_term_string\"}}],\"inLanguage\":\"pt-BR\"}]}<\/script>\n<!-- \/ Yoast SEO plugin. -->","yoast_head_json":{"title":"Minera\u00e7\u00e3o de Texto - REVISA","robots":{"index":"index","follow":"follow","max-snippet":"max-snippet:-1","max-image-preview":"max-image-preview:large","max-video-preview":"max-video-preview:-1"},"canonical":"https:\/\/wordpress.ft.unicamp.br\/revisa\/mineracao-de-texto\/","og_locale":"pt_BR","og_type":"article","og_title":"Minera\u00e7\u00e3o de Texto - REVISA","og_description":"1. O que \u00e9 Minera\u00e7\u00e3o de Texto? A minera\u00e7\u00e3o de texto, como disciplina inovadora e vital no campo da ci\u00eancia de dados, est\u00e1 intrinsecamente ligada [&hellip;]","og_url":"https:\/\/wordpress.ft.unicamp.br\/revisa\/mineracao-de-texto\/","og_site_name":"REVISA","article_modified_time":"2025-11-18T21:53:16+00:00","twitter_card":"summary_large_image","twitter_misc":{"Est. tempo de leitura":"8 minutos"},"schema":{"@context":"https:\/\/schema.org","@graph":[{"@type":"WebPage","@id":"https:\/\/wordpress.ft.unicamp.br\/revisa\/mineracao-de-texto\/","url":"https:\/\/wordpress.ft.unicamp.br\/revisa\/mineracao-de-texto\/","name":"Minera\u00e7\u00e3o de Texto - REVISA","isPartOf":{"@id":"https:\/\/wordpress.ft.unicamp.br\/revisa\/#website"},"datePublished":"2023-11-27T00:50:26+00:00","dateModified":"2025-11-18T21:53:16+00:00","breadcrumb":{"@id":"https:\/\/wordpress.ft.unicamp.br\/revisa\/mineracao-de-texto\/#breadcrumb"},"inLanguage":"pt-BR","potentialAction":[{"@type":"ReadAction","target":["https:\/\/wordpress.ft.unicamp.br\/revisa\/mineracao-de-texto\/"]}]},{"@type":"BreadcrumbList","@id":"https:\/\/wordpress.ft.unicamp.br\/revisa\/mineracao-de-texto\/#breadcrumb","itemListElement":[{"@type":"ListItem","position":1,"name":"Home","item":"https:\/\/wordpress.ft.unicamp.br\/revisa\/"},{"@type":"ListItem","position":2,"name":"Minera\u00e7\u00e3o de Texto"}]},{"@type":"WebSite","@id":"https:\/\/wordpress.ft.unicamp.br\/revisa\/#website","url":"https:\/\/wordpress.ft.unicamp.br\/revisa\/","name":"REVISA","description":"A REvista VIrtual de Sistemas de Apoio a decis\u00e3o \u00e9 escrita por alunos de Sistemas de Apoio \u00e0 Decis\u00e3o da FT\/UNICAMP e estende a excel\u00eancia da gradua\u00e7\u00e3o para toda comunidade.","potentialAction":[{"@type":"SearchAction","target":{"@type":"EntryPoint","urlTemplate":"https:\/\/wordpress.ft.unicamp.br\/revisa\/?s={search_term_string}"},"query-input":{"@type":"PropertyValueSpecification","valueRequired":true,"valueName":"search_term_string"}}],"inLanguage":"pt-BR"}]}},"_links":{"self":[{"href":"https:\/\/wordpress.ft.unicamp.br\/revisa\/wp-json\/wp\/v2\/pages\/50","targetHints":{"allow":["GET"]}}],"collection":[{"href":"https:\/\/wordpress.ft.unicamp.br\/revisa\/wp-json\/wp\/v2\/pages"}],"about":[{"href":"https:\/\/wordpress.ft.unicamp.br\/revisa\/wp-json\/wp\/v2\/types\/page"}],"author":[{"embeddable":true,"href":"https:\/\/wordpress.ft.unicamp.br\/revisa\/wp-json\/wp\/v2\/users\/113"}],"replies":[{"embeddable":true,"href":"https:\/\/wordpress.ft.unicamp.br\/revisa\/wp-json\/wp\/v2\/comments?post=50"}],"version-history":[{"count":20,"href":"https:\/\/wordpress.ft.unicamp.br\/revisa\/wp-json\/wp\/v2\/pages\/50\/revisions"}],"predecessor-version":[{"id":716,"href":"https:\/\/wordpress.ft.unicamp.br\/revisa\/wp-json\/wp\/v2\/pages\/50\/revisions\/716"}],"wp:attachment":[{"href":"https:\/\/wordpress.ft.unicamp.br\/revisa\/wp-json\/wp\/v2\/media?parent=50"}],"curies":[{"name":"wp","href":"https:\/\/api.w.org\/{rel}","templated":true}]}}