Vector Model for the Text Style Analysis

Authors

  • N.P. Darchuk Taras Shevchenko National University of Kyiv
  • I.V. Vasileva Taras Shevchenko National University of Kyiv
  • A.N. Vasilev Taras Shevchenko National University of Kyiv

DOI:

https://doi.org/10.15407/ujpe66.5.373

Keywords:

physics of complex systems, quantitative linguistics, vector, energy, distribution

Abstract

The application of physical approaches to the analysis of the authorial styles of Ukrainian writers has been considered. A model, where the literary styles are described in a multidimensional space with the help of unit vectors, is proposed. The numerical characteristic of the style is the scalar product of the corresponding vector and a vector that determines the general style for a group of authors. This parameter is shown to depend linearly on author’s rank. This behavior confi rms the hypothesis of joining the majority, according to which an author, when selecting his/her literary style, takes the style of his/her successful colleagues into account.

References

M. Tsizh, B. Novosyadlyj, Yu. Holovatch, N.I. Libeskind. Large-scale structures in the ΛCDM Universe: network analysis and machine learning. Month. Not. R. Astronom. Soc. 495, 1311 (2020). https://doi.org/10.1093/mnras/staa1030

Yu. Holovatch, M. Dudka, V. Blavatska, V. Palchykov, M. Krasnytska, O. Mryglod. Statistical physics of complex systems in the world and in Lviv. Zh. Fiz. Dosl. 22, 2801 (2018) (in Ukrainian). https://doi.org/10.30970/jps.22.2801

Yu. Holovatch, M. Dudka, V. Blavatska, V. Palchykov, M. Krasnytska, O. Mryglod. Statistical Physics of Complex

Systems. Preprint ICMP-17-06U (Institute for Condensed Matter Physics, Lviv, 2017) (in Ukrainian).

Y. Holovatch, V. Palchykov. Complex networks of words in fables. In: Maths Meets Myths: Complexity-Science Approaches to Folktales, Myths, Sagas, and Histories. Edited by R. Kenna, M. MacCarron, P. MacCarron (Springer, 2016). https://doi.org/10.1007/978-3-319-39445-9_9

Yu. Holovatch, R. Kenna, P. MacCarron, P. Sarkanych, N. Fedorak, J. Yose. Mathematics and myths: A quantitative approach to comparative mythology. Ukr. Modern. 27, 108 (2020) (in Ukrainian).

R. de Regt, C. von Ferber, Yu. Holovatch, M. Lebovka. Public transportation in UK viewed as a complex network. Transportmetrica A 15, 722, (2019).

https://doi.org/10.1080/23249935.2018.1529837

Yu. Holovatch, R. Kenna, S. Thurner. Complex systems: Physics beyond physics. Eur. J. Phys. 38, 023002 (2017) [arXiv: 1610.01002].

https://doi.org/10.1088/1361-6404/aa5a87

B. Berche, C. von Ferber, T. Holovatch, Yu. Holovatch. Transportation network stability: A case study of city

transit. Adv. Compl. Syst. 15, 1250063 (2012) [arXiv: 1201.5532].

https://doi.org/10.1142/S0219525912500634

F. Jovanovic, C. Schinckus. Econophysics: A new challenge for fi nancial economics. J. Hist. Econom. Thought 35, 319 (2012).

https://doi.org/10.1017/S1053837213000205

R. Mantegna, H. Stanley. An Introduction to Econophysics (Cambridge Univ. Press, 2000).

https://doi.org/10.1017/CBO9780511755767

C. Schinckus, F. Jovanovic. Towards a transdisciplinary econophysics. J. Econom. Method. 20, 164 (2013).

https://doi.org/10.1080/1350178X.2013.801561

D. Stauff er. A biased review of sociophysics. J. Stat. Phys. 151, 9 (2013) [arXiv: 1207.6178v1].

https://doi.org/10.1007/s10955-012-0604-9

C. Castellano, S. Fortunato, V. Loreto. Statistical physics of social dynamics. Rev. Mod. Phys. 81, 591 (2009) [arXiv:

https://doi.org/10.1103/RevModPhys.81.591

3256].

S. Galam. Sociophysics: A review of Galam models. Int. J. Mod. Phys. C 19, 409 (2008)[arXiv: 0803.1800].

https://doi.org/10.1142/S0129183108012297

O.M. Vasilev. A model of rumors spreading in the community with opportunistic behavior. Zh. Fiz. Dosl. 22, 3801 (2018) (in Ukrainian).

https://doi.org/10.30970/jps.22.3801

O.M. Vasilev, O.V. Chalyi. The modeling of macroeconomic dynamics by the methods of econophysics. Zh. Fiz. Dosl. 17, 4801 (2013) (in Ukrainian).

https://doi.org/10.30970/jps.17.4801

A. Rovenchak, S. Buk. Quantum distributions and research of texts: temperature and literature. Ukr. Modern. 27, 29 (2020) (in Ukrainian).

S.N. Buk, Yu. Krynytskyi, A. Rovenchak. Properties of autosemantic word networks in Ukrainian texts. Adv. Compl. Syst. 22, 1950016 (2019). https://doi.org/10.1142/S0219525919500164

A. Rovenchak, S. Buk. Part-of-speech sequences in literary text: Evidence from Ukrainian. J. Quantit. Linguist. 25, 1 (2017). https://doi.org/10.1080/09296174.2017.1324601

A.A. Rovenchak, S. Buk. Defi ning thermodynamic parameters for texts from word rank-frequency distributions. J. Phys. Stud. 15, 1005 (2011). https://doi.org/10.30970/jps.15.1005

A.A. Rovenchak, S. Buk. Application of a quantum ensemble model to linguistic analysis. Physica A 390, 1326 (2011) [arXiv: 1011.5076]. https://doi.org/10.1016/j.physa.2010.12.009

Yu. Holovatch, V. Palchykov. Fox Mykyta and networks of language. Zh. Fiz. Dosl. 11, 22 (2007).

https://doi.org/10.30970/jps.11.022

O. Vasilev, O. Chalyi, I. Vasileva. Mathematical methods and models in linguistics. Ukr. Modern. 27, 9 (2020) (in Ukrainian).

A.N. Vasilev, I.V. Vasileva. Physics beyond physics: Application of physical approaches in quantitative linguistics. Ukr. J. Phys. 65, 143 (2020).

https://doi.org/10.15407/ujpe65.2.143

A. Vasilev, I. Vasileva. Text length and vocabulary size: Case of the Ukrainian writer Ivan Franko. Glottometrics 43, 1 (2018).

A.N. Vasilev, A.V. Chalyi, I.V. Vasileva. About "exotic" problems of physics, Winnie the Pooh and Zipf's law. Zh. Fiz. Dosl. 17, 1001 (2013) (in Ukrainian).

https://doi.org/10.30970/jps.17.1001

O.M. Vasiliev, I.V. Vasilieva. Features of the creation of mathematical models in linguistics. Visn. Kherson. Nats. Tekhn. Univ. 69, 99 (2019) (in Ukrainian).

Yu. Tuldava. Problems and Methods of Quantitative-Systemic Research of Lexicon (Valgus, 1987) (in Russian).

R. Piotrovskii, K. Bektaev, A. Piotrovskaya. Mathematical Linguistics (Vysshaya Shkola, 1977) (in Russian).

V.V. Levitskii. Quantitative Methods in Linguistics (Ruta, 2005) (in Russian).

N. Darchuk, I. Denysenko, O. Siruk, V. Sorokin. Theoretical issues of modeling the ideographic thesaurus of the Ukrainian language. Ukr. Movozn. 24, 107 (2002) (in Ukrainian).

N.P. Darchuk, L.A. Aleksienko, V.M. Sorokin. Parameterized database of poetic speech as a source of philological studies. Mova Probl. Prykl. Lingvist. 9, 15 (2004) (in Ukrainian).

N.P. Darchuk. Poetic dictionary from the viewpoint of the world's linguistic picture. Ukr. Movozn. 35, 55 (2006) (in Ukrainian).

N.P. Darchuk. Research corpus of the Ukrainian language: Basic principles and prospects. Visn. Kyiv. Nats. Univ. Literat. Lingvist. Folklor. 21, 45, (2010) (in Ukrainian).

N.P. Darchuk. Automatic syntactic analysis of the texts in the corpus of the Ukrainian language. Ukr. Movozn. 43, 11 (2013) (in Ukrainian).

N.P. Darchuk. Semantics formalization directions. Movn. Kontsept. Kart. Svit. 46, 385 (2013) (in Ukrainian).

R.T. Grom'yak, Yu.I. Kovaliv, V.I. Teremko. Literary Dictionary-Reference Book (Akademiya, 1997) (in Ukrainian).

H.S. Green, C.A. Hurst. Order-Disorder Phenomena (Interscience, 1964).

B.M. McCoy, T.T. Wu. The Two-Dimensional Ising Model (Cambridge Univ. Press, 1973).

https://doi.org/10.4159/harvard.9780674180758

G. Salton, A. Wong, C.S. Yang. A vector space model for information retrieval. Commun. ACM 18, 613 (1975).

https://doi.org/10.1145/361219.361220

W. Pauli. General Principles of Quantum Mechanics (Springer, 1980).

https://doi.org/10.1007/978-3-642-61840-6

A.S. Davydov. Quantum Mechanics (Pergamon Press, 1976).

Th. Veblen. The Theory of the Leisure Class (Courier Corporation, 1994).

H.A. Simon. A Behavioral Model of Rational Choice. Quart. J. Econom. 69, 99 (1955).

https://doi.org/10.2307/1884852

N.P. Darchuk. Structural-statistical database of the modern Ukrainian language on the basis of frequency dictionaries. In Vocabulum et Vocabularium (Grognen. Gos. Univ., 2005) (in Russian).

L.A. Alekseenko, N.P. Darchuk, O.N. Zuban', V.V. Sorokin. Parameterized database of poetic speech as a source and an instrument for philological studies. In: Proceedings of the International Conference "Computer Linguistics without Borders" (St. Petersburg, 2004).

I. Vasileva. The application of computer thesaurus in the study of the poets' language. Leksykogr. Byulet. 13, 161 (2006) (in Ukrainian).

Shang-Keng Ma. Modern Theory of Critical Phenomena (Benjamin, 1976).

A.Z. Patashinskii, V.L. Pokrovskii. Fluctuation Theory of Phase Transitions (Pergamon Press, 1982).

Published

2021-05-28

How to Cite

Darchuk, N., Vasileva, I., & Vasilev, A. (2021). Vector Model for the Text Style Analysis. Ukrainian Journal of Physics, 66(5), 373. https://doi.org/10.15407/ujpe66.5.373

Issue

Section

General physics