UMA ABORDAGEM DE TÉCNICAS DE MACHINE LEARNING NÃO SUPERVISIONADAS PARA TRANSIÇÕES DE FASE CLÁSSICAS

UMA ABORDAGEM DE TÉCNICAS DE MACHINE LEARNING NÃO SUPERVISIONADAS PARA TRANSIÇÕES DE FASE CLÁSSICAS

CARVALHO, Danyella Oliveira de

URI: http://hdl.handle.net/123456789/2892

Date: 2022-10-05

Abstract:

RESUMO: Machine Learning (ML), ou Aprendizado de Máquina, é um subcampo da Inteligência Artificial, constituído por classes de algoritmos de aprendizagem, que operam de forma tal que se possa extrair informação e “aprender” de uma grande quantidade de dados. Estes algoritmos têm sido incorporados no âmbito das ciências físicas como auxiliares na identificação de padrões, como, por exemplo, no estudo de transições de fase. Tais técnicas mostram-se como uma alternativa na compreensão de problemas complexos. Porém, apesar dos esforços, o limite de aplicação dessas técnicas ainda não é inteiramente claro. Em vista disso, investigamos neste trabalho as transições de fase do modelo de Potts @-estados, nos casos @ = 3, 4 e 5, utilizando técnicas de ML não supervisionadas, a saber, Principal Component Analysis (PCA), K-Means Clustering e Topological Data Analysis (TDA). As duas primeiras são técnicas de redução de dimensionalidade e clusterização, respectivamente, enquanto a última usa propriedades de invariantes topológicos para separar dados semelhantes. Já a construção da base de dados se dá através de simulações de Monte Carlo, em faixas de temperaturas que contêm as temperaturas críticas do modelo. A análise desse toy model nos permite compreender detalhadamente essas técnicas no âmbito de Matéria Condensada e Mecânica Estatística. Utilizando a técnica PCA, foram identificados os pontos críticos com boa precisão em todos os casos (@ = 3, 4 e 5), através de parâmetros construídos a partir das componentes principais. Além disso, observamos que a formação de clusters para temperaturas abaixo da temperatura crítica nos fornece informações sobre as simetrias da Hamiltoniana. Também foram encontrados resultados concordantes para os valores de expoentes críticos, porém com fortes efeitos de tamanho finito. Continuando, usamos a técnica K-Means para analisar o comportamento dos clusters obtidos através de PCA sob variação de temperatura. Esta análise retornou os pontos críticos, numa aproximação para o limite termodinâmico, com erro relativo inferior a 1%, em comparação com os valores exatos do modelo de Potts. Por fim, com o método TDA, obtivemos os pontos críticos com grande precisão, além de notar que esta técnica possui poucos efeitos de tamanho finito. Em resumo, obtivemos os pontos críticos com boa precisão usando os três métodos, e esperamos que esse estudo detalhado possa esclarecer as características particulares das técnicas aqui expostas, facilitando eventuais utilizações em problemas mais desafiadores no futuro. ABSTRACT: Machine Learning (ML) is a subfield of Artificial Intelligence, consisting of a class of algorithms that are able to extract information – i.e. “learning” – from a large amount of data. These algorithms have been incorporated in the physical sciences as supporting tools to identify patterns, for instance, in the study of phase transitions. Indeed, such techniques are placed as promising methods to understand complex systems. Yet, despite the great efforts, the limits of their applications are not entirely clear. In view of this, in this work we investigate the phase transitions of the q-states Potts model, in the cases q = 3, 4, and 5, using unsupervised ML techniques, namely, the Principal Component Analysis (PCA), the K-Means Clustering and the Topological Data Analysis (TDA). The first two are concerned with dimensionality reduction and clustering techniques, respectively, while the last one uses topological invariant properties to separate data by similarity. The dataset is generated through Monte Carlo simulations, in ranges of temperature that contain the critical temperatures of the model. The analysis of such a toy model allows us to understand the details about the applications of these techniques within the scope of CondensedMatter Physics and Statistical Mechanics. Using the PCA technique, the critical points are identifiedwith good precision for all cases (q = 3, 4 and 5), through parameters constructed from the principal components. In addition, we notice that the formation of clusters at low-temperatures provides information about the symmetries of the Hamiltonian. Also, we investigate the critical exponents, whose results are in rough agreement with the exact ones, but with substantial finite-size effects. Moreover, we use the K-Means technique to analyze the behavior of the clusters obtained through PCA, under temperature variation. This analysis returned the critical points, in an approximation to the thermodynamic limit, with a relative error of less than 1%, compared to the exact values of the Potts model. Finally, with the TDA method, we obtain the critical points with very good precision, besides the fact that this technique has a weak dependency on the system size. In summary, the critical points are obtained with good precision using these three methods, and we expect that this detailed study may shed light on the features of these techniques when dealing with phase transitions, therefore paving the way to their usage in more challenging problems.

Description:

Orientador: Prof. Dr. José Pimentel de Lima Coorientador: Prof. Dr. Natanael de Carvalho Costa Examinador externo: Prof. Dr. Tiago Mendes Santos (University of Augsburg) Examinador interno: Prof. Dr. Eduardo da Costa Girão Examinador suplente: Prof. Dr. Francisco Welington de Sousa Lima

Show full item record