Exploring the relationship between two compositions using canonical correlation analysis

Authors

  • Glòria Mateu-Figueras University of Girona, Department of Computer Science, Applied Mathematics and Statistics, Girona, Spain Author
  • Josep Daunis-i-Estadella University of Girona, Department of Computer Science, Applied Mathematics and Statistics, Girona, Spain Author
  • Germà Coenders University of Girona, Department of Economics, Girona, Spain Author
  • Berta Ferrer-Rosell University of Girona, Department of Economics, Girona, Spain Author
  • Ricard Serlavós University Ramon Llull, Department of People Management and Organization, Barcelona, Spain Author
  • Joan Manuel Batista-Foguet University Ramon Llull, Department of People Management and Organization, Barcelona, Spain Author

DOI:

https://doi.org/10.51936/epet8264

Abstract

The aim of this article is to describe a method for relating two compositions which combines compositional data analysis and canonical correlation analysis (CCA), and to examine its main statistical properties. We use additive log-ratio (ALR) transformation on both compositions and apply standard CCA to the transformed data. We show that canonical variates are themselves log-ratios and log-contrasts. The first pair of canonical variates can be interpreted as the log-contrast of a composition that has the maximum correlation with a log-contrast of the other composition. The second pair can be interpreted as the log-contrast of a composition that has the maximum correlation with a log-contrast of the other composition, under the restriction that they are uncorrelated with the first pair, and so on. Using properties from changes of basis, we prove that both canonical correlations and canonical variates are invariant to the choice of divisors in ALR transformation. We show how to implement the analysis and interpret the results by means of an illustration from the social sciences field using data from Kolb's Learning Style Inventory and Boyatzis' Philosophical Orientation Questionnaire, which distribute a fixed total score among several learning modes and philosophical orientations.

Downloads

Published

2024-12-10

Issue

Section

Articles