Doutorado em Ciência da Computação UFPI/UFMA

Doutorado em Ciência da Computação UFPI/UFMA http://hdl.handle.net/123456789/3648 Doutorado em Ciência da Computação UFPI/UFMA Tue, 21 Apr 2026 13:24:49 GMT 2026-04-21T13:24:49Z DETECÇÃO DE CORRELAÇÕES ESPÚRIAS COM INTELIGÊNCIA ARTIFICIAL EXPLICÁVEL http://hdl.handle.net/123456789/4111 DETECÇÃO DE CORRELAÇÕES ESPÚRIAS COM INTELIGÊNCIA ARTIFICIAL EXPLICÁVEL SOARES, Hélcio de Abreu RESUMO: Apesar dos avanços em Inteligência Artificial (IA), modelos de Machine Learning e Deep Learning ainda carecem de transparência e explicabilidade, sendo tratados como “caixas-pretas”. Este trabalho aborda o problema das correlações espúrias — associações entre padrões e classes sem relação causal — que, em tarefas de classificação binária em Processamento de Linguagem Natural (PLN), comprometem a precisão, a imparcialidade e a generalização dos modelos. Propomos um método que combina técnicas de Inteligência Artificial Explicável (XAI) e aprendizado não supervisionado para identificar e graduar padrões espúrios. Utilizando o algoritmo K-means, os padrões são agrupados e analisados pela distância aos centroides, sob a hipótese de que distâncias maiores indicam maior grau de espuriedade. A abordagem considera a influência desses padrões sobre explicadores e sua associação com erros de previsão. A metodologia é aplicada a dados de licitações e contratos do Tribunal de Contas do Estado do Piauí (TCE-PI), usando modelos baseados em Support Vector Machine (SVM), Logistic Regression (LR) com representações textuais TF-IDF e Word Embeddings, e o modelo BERTimbau, como codificador e classificador com embeddings contextuais dinâmicos. Aplicamos também o método ao IMDB para avaliar generalização e compará-lo com métodos de referências. Os resultados confirmam a hipótese e mostram consistência entre modelos e bases. As principais contribuições incluem: (i) método agnóstico a modelos e explicadores; (ii) detecção automática de padrões espúrios; (iii) uma métrica de espuriedade baseada na distância ao centroide; e (iv) organização lógica e interpretável dos padrões, ampliando a compreensão dos modelos e apoiando a mitigação de padrões espúrios. ABSTRACT: Despite advances in Artificial Intelligence (AI), Machine Learning and Deep Learning models still lack transparency and explainability, often being regarded as “black boxes.” This dissertation addresses the issue of spurious correlations—associations between patterns and classes that lack causal relationships—which, in binary classification tasks in Natural Language Processing (NLP), undermine model accuracy, fairness, and generalization. We propose a method that combines Explainable Artificial Intelligence (XAI) techniques with unsupervised learning to identify and rank spurious patterns. Using the K-means algorithm, patterns are clustered and evaluated based on their distance from centroids under the hypothesis that greater distances indicate higher degrees of spuriousness. The approach accounts for the influence of these patterns on explainers and their association with prediction errors. The methodology is applied to procurement and contract data from the Court of Auditors of the State of Piauí (TCE-PI), using Support Vector Machines (SVM), Logistic Regression with TF-IDF and Word Embedding text representations, and the BERTimbau model, both as encoder and classifier with dynamic contextual embeddings. The method is also applied to the IMDB dataset to evaluate generalization and compare it against reference methods. The results confirm the hypothesis and reveal consistent patterns across models and datasets. The main contributions include: (i) a model- and explainer-agnostic method; (ii) automatic detection of spurious patterns; (iii) a spuriousness metric based on centroid distance; and (iv) logical and interpretable organization of patterns, enhancing model understanding and supporting the mitigation of spurious correlations. Orientador: Prof. Dr. Rodrigo de Melo Souza Veras Co-orientador: Prof. Dr. Anselmo Cardoso de Paiva - UFMA Examinador externo: Prof. Dr. Ajalmar Rego da Rocha Neto - IFC Examinador externo: Prof. Dr. Gustavo Paiva Guedes e Silva - CEFET/RJ Examinador interno: Prof. Dr. Vinícius Ponte Machado Thu, 27 Nov 2025 00:00:00 GMT http://hdl.handle.net/123456789/4111 2025-11-27T00:00:00Z XRaySwinGen: pré-laudos médicos automáticos para exames de Raio - X de Tórax com modelo multimodal http://hdl.handle.net/123456789/4110 XRaySwinGen: pré-laudos médicos automáticos para exames de Raio - X de Tórax com modelo multimodal MAGALHÃES JUNIOR, Gilvan Veras RESUMO: A radiologia tem papel crucial na medicina moderna ao fornecer diagnósticos precisos por meio de imagens não invasivas. Entretanto, a elaboração manual de laudos médicos é um processo demorado e sujeito a falhas humanas. Esta tese propõe um modelo multimodal para a geração automática de pré-laudos médicos a partir de radiografias de tórax, combinando técnicas de Visão Computacional e Processamento de Linguagem Natural com base na arquitetura Transformer. Inicialmente, foi desenvolvida uma abordagem com codificador visual baseado no Swin Transformer e decodificador textual integrando camadas de atenção cruzada e treinamento bilíngue com conjuntos de dados em Português PT-BR ou Inglês. Posteriormente, a arquitetura foi aprimorada com a introdução de um módulo de memória relacional, permitindo a retenção de informações contextuais de longo prazo durante a geração dos textos. O modelo final integra de forma coesa os componentes visuais e textuais por meio de normalização condicional orientada à memória. Os experimentos, realizados nas bases de imagens Proposta, IU Chest X-ray, NIH Chest X-ray e MIMIC-CXR-JPG, evidenciaram ganhos consistentes. Na avaliação com o conjunto de dados completo da MIMIC-CXR-JPG, o modelo com Swin Transformer e decodificador textual atingiu ROUGE-L de 0.304, METEOR de 0.233 e BLEU-4 de 0.054. A inclusão da memória relacional elevou essas métricas para 0.321, 0.281 e 0.114, respectivamente. Na versão do mesmo conjunto de dados sem o histórico clínico, o desempenho do modelo com memória relacional foi ainda maior, alcançando ROUGE-L de 0.416, METEOR de 0.384 e BLEU-4 de 0.187. A melhora consistente das métricas com a adição do módulo de memória relacional demonstra o impacto positivo da retenção de contexto de longo prazo na qualidade textual dos pré-laudos gerados. Esses resultados reforçam a relevância do modelo proposto e motivam sua adoção em cenários clínicos que demandam precisão, fluidez e confiabilidade na geração automática de relatórios médicos. ABSTRACT: Radiology plays a crucial role in modern medicine by providing accurate diagnoses through non-invasive imaging. However, the manual creation of medical reports is a time-consuming process and prone to human error. This thesis proposes a multimodal model for the automatic generation of preliminary medical reports from chest radiographs, combining Computer Vision and Natural Language Processing techniques based on the Transformer architecture. Initially, an approach was developed with a visual encoder based on the Swin Transformer and a textual decoder integrating cross-attention layers and bilingual training with datasets in Brazilian Portuguese (PT-BR) or English. Subsequently, the architecture was enhanced with the introduction of a relational memory module, enabling the retention of long-term contextual information during text generation. The final model cohesively integrates the visual and textual components through memory-oriented conditional normalization. The experiments, conducted on the Proposta, IU Chest X-ray, NIH Chest X-ray, and MIMIC-CXR-JPG image datasets, showed consistent gains. In the evaluation with the complete MIMIC-CXR-JPG dataset, the model with the Swin Transformer and textual decoder achieved a ROUGE-L of 0.304, a METEOR of 0.233, and a BLEU-4 of 0.054. The inclusion of the relational memory module raised these metrics to 0.321, 0.281, and 0.114, respectively. In the version of the same dataset without clinical history, the performance of the model with relational memory was even higher, reaching a ROUGE-L of 0.416, a METEOR of 0.384, and a BLEU-4 of 0.187. The consistent improvement in metrics with the addition of the relational memory module demonstrates the positive impact of long-term context retention on the textual quality of the generated pre-reports. These results reinforce the relevance of the proposed model and encourage its adoption in clinical scenarios that demand accuracy, fluidity, and reliability in the automatic generation of medical reports. Orientador: Prof. Dr. Pedro de Alcântara dos Santos Neto Co-orientador: Prof. Dr. Anselmo Cardoso de Paiva Examinador externo: Prof. Dr. António Manuel Trigueiros da Silva Cunha - Universidade de Trás-os-Montes e Alto Douro Examinador externo: Prof. Dr. Cláudio de Souza Baptista - UFCG Examinador interno: Prof. Dr. Rodrigo De Melo Souza Veras Examinador interno: Prof. Dr. Kelson Romulo Teixeira Aires Thu, 27 Nov 2025 00:00:00 GMT http://hdl.handle.net/123456789/4110 2025-11-27T00:00:00Z SOURCE CODE EXPERTISE: Improving Knowledge Models and Assessing Generative AI Impact http://hdl.handle.net/123456789/4059 SOURCE CODE EXPERTISE: Improving Knowledge Models and Assessing Generative AI Impact CASTRO, Otávio Cury da Costa Abstract: Identifying developer expertise in source code is valuable in various Software Engineering contexts. Knowledgeable developers are best suited to perform tasks such as code review and onboarding. Numerous models have been proposed to estimate source code knowledge, making it a well-explored topic; however, important gaps remain that affect the accuracy and applicability of these models. Moreover, the increasing use of Generative Artificial Intelligence (GenAI) tools may influence how code expertise is acquired and measured. This study aims to develop more accurate models for identifying source code experts. We first investigate the correlation between development history variables and developers’ knowledge of source code files. We extract metrics from public and private repositories and survey developers about the files they contributed to. Based on these data, we propose a linear model and train machine learning classifiers, comparing their performance with existing models. We also apply the proposed models to the Truck Factor (TF) metric to assess their practical implications in identifying critical developers. To examine the impact of GenAI, we build a dataset combining code expertise metrics with information on ChatGPT-generated code integrated into open-source projects. We simulate different usage scenarios by assigning a portion of contributions to GenAI instead of developers and survey developers about their perception of GenAI’s effects on code comprehension. Our results show that First Authorship and Recency of Modification are the variables most strongly correlated with source code knowledge. The proposed machine learning models outperform linear baselines, achieving F-scores between 71% and 73%. When applied to the TF algorithm, they improved developer identification, reaching a best average F-score of 74%. GenAI usage negatively affected TF reliability, even in low proportions. Developers reported mixed perceptions, with concerns, especially about use by novice programmers. Orientador: Guilherme Amaral Avelino Co-orientador: Prof. Dr. Pedro de Alcantara dos Santos Neto Examinador interno: Prof. Dr. Vinicius Ponte Machado Examinador interno: Prof. Dr. Romuere Rodrigues Veloso e Silva Examinador externo: Prof. Dr. Lincoln Souza Rocha Examinador externo: Prof. Dr. André Cavalcante Hora Tue, 16 Sep 2025 00:00:00 GMT http://hdl.handle.net/123456789/4059 2025-09-16T00:00:00Z MELHORAMENTO NA CLASSIFICAÇÃO DE PÓLEN USANDO REDE NEURAL HÍBRIDA COM MECANISMO DE ATENÇÃO E SEPARAÇÃO POR VISTAS: uma abordagem Equatorial e Polar http://hdl.handle.net/123456789/4058 MELHORAMENTO NA CLASSIFICAÇÃO DE PÓLEN USANDO REDE NEURAL HÍBRIDA COM MECANISMO DE ATENÇÃO E SEPARAÇÃO POR VISTAS: uma abordagem Equatorial e Polar SOARES, Júlio César da Silva Resumo: A pesquisa com grãos de pólen tem aplicações em áreas como ecologia, controle de alergias e rastreamento de alimentos. No entanto, a classificação desses grãos enfrenta desafios significativos devido à limitação dos dados disponíveis e à variabili- dade das características morfológicas. Recentemente, a aplicação de Redes Neurais Convolucionais (CNNs) trouxe avanços expressivos nesse campo, com técnicas como transferência de aprendizado e aumento de dados sendo utilizadas para melhorar os resultados. Este estudo visa inovar na classificação de imagens de grãos de pólen ao considerar as diferenças entre as vistas equatorial e polar. O objetivo central é avaliar o impacto dessas vistas na tarefa de classificação, partindo da hipótese de que a vista polar, por revelar detalhes mais precisos do que a equatorial, pode proporcionar um desempenho superior. Assim, ao separar os grãos de pólen com base nas vistas, espera-se obter resultados que igualem ou superem os reportados na literatura, contribuindo de maneira original para o avanço do estado da arte. A pesquisa foi estruturada em três etapas interdependentes. Na primeira etapa, as bases de dados foram classificadas em seu formato original, empregando redes pré- treinadas e redes baseadas em mecanismos de atenção, com treinamento iniciado do zero. A segunda etapa focou na separação das bases em vistas equatorial e polar, utilizando técnicas de aprendizado semi-supervisionado para garantir uma divisão precisa. Na terceira e última etapa, as novas bases foram classificadas utilizando as redes que apresentaram o melhor desempenho na etapa inicial, permitindo uma avaliação comparativa entre as vistas. Os resultados preliminares demonstram que as redes pré-treinadas, particularmente a DenseNet201, alcançaram melhorias substanciais ao utilizar a base CPD1 dividida por vistas. A vista polar obteve as melhores métricas, com uma acurácia de 99.10%, superando as pesquisas anteriores que utilizaram a mesma base de dados CPD1, confirmando a hipótese inicial e destacando a relevância da separação por vistas. Abstract: Research on pollen grains has applications in areas such as ecology, allergy control, and food traceability. However, the classification of these grains faces significant challenges due to the limited availability of data and the variability of morphological characteristics. Recently, the application of Convolutional Neural Networks (CNNs) has led to signif- icant advancements in this field, with techniques such as transfer learning and data augmentation being employed to improve results. This study aims to innovate in the classification of pollen grain images by considering the differences between equatorial and polar views. The central objective is to assess the impact of these views on the classification task, based on the hypothesis that the polar view, by revealing more precise details than the equatorial view, can provide superior performance. Thus, by separating pollen grains based on these views, it is expected to achieve results that match or exceed those reported in the literature, contributing originally to the advancement of the state of the art. The research was structured into three interdependent stages. In the first stage, the datasets were classified in their original format, employing pre-trained networks and attention-based networks with training initiated from scratch. The second stage focused on separating the datasets into equatorial and polar views, using semi-supervised learning techniques to ensure accurate division. In the third and final stage, the newly generated datasets were classified using the networks that performed best in the initial stage, allowing for a comparative evaluation between the views. Preliminary results show that pre-trained networks, particularly DenseNet201, achieved substantial improvements when using the CPD1 dataset divided by views. The polar view achieved the best metrics, with an accuracy of 99.1%, surpassing previous studies that used the same CPD1 dataset, confirming the initial hypothesis and highlighting the relevance of view separation. Orientador: Prof. Dr. Kelson Romulo Teixeira Aires Co-orientador: Prof. Dr. Rodrigo de Melo Souza Veras Examinador interno: Prof.º Dr. Vinicius Ponte Machado Examinador interno: Prof. Dr. Ivan Saraiva Silva Examinadora interna: Profa. Dra. Juliana do Nascimento Bendini Examinadora externa: Profa. Dra. Andrea Gomes Campos Bianchi - UFOP Examinador externo: Prof. Dr. Pedro Luiz de Paula - UTFPR Tue, 16 Sep 2025 00:00:00 GMT http://hdl.handle.net/123456789/4058 2025-09-16T00:00:00Z