Next Article in Journal
Smart Architecture Energy Management through Dynamic Bin-Packing Algorithms on Cloud
Next Article in Special Issue
Quasi Association Models for Square Contingency Tables with Ordinal Categories
Previous Article in Journal
Fixed-Time Formation Control for Second-Order Disturbed Multi-Agent Systems under Directed Graph
Previous Article in Special Issue
A Generalized Two-Dimensional Index to Measure the Degree of Deviation from Double Symmetry in Square Contingency Tables
 
 
Font Type:
Arial Georgia Verdana
Font Size:
Aa Aa Aa
Line Spacing:
Column Width:
Background:
Article

Families of Generalized Quasisymmetry Models: A ϕ-Divergence Approach

Institute of Statistics, RWTH Aachen University, 52062 Aachen, Germany
Submission received: 26 October 2021 / Revised: 16 November 2021 / Accepted: 25 November 2021 / Published: 2 December 2021
(This article belongs to the Special Issue Advances in Quasi-Symmetry Models)

Abstract

:
The quasisymmetry ( Q S ) model for square contingency tables is revisited, highlighting properties and features on the basis of its alternative definitions. More parsimonious Q S -type models, such as the ordinal QS model for ordinal classification variables and models based on association models (AMs) with homogeneous row and column scores, are discussed. All these models are linked to the local odds ratios (LOR). Q S -type models and AMs were extended in the literature for generalized odds ratios other than LOR. Furthermore, in an information-theoretic context, they are expressed as distance models from a parsimonious reference model (the complete symmetry for Q S and the independence for AMs), while they satisfy closeness properties with respect to Kullback–Leibler (KL) divergence. Replacing the KL by ϕ divergence, flexible classes of Q S -type models for LOR, AMs for LOR, and AMs for generalized odds ratios were generated. However, special Q S -type models that are based on homogeneous AMs for LOR have not been extended to ϕ -divergence-based classes so far, or the Q S -type models for generalized odds ratios. In this work, we develop these missing extensions, and discuss Q S -type models and their generalizations in depth. These flexible families enrich the modeling options, leading to models of better fit and sound interpretation, as illustrated by representative examples.

1. Introduction

A special type of square contingency table that occurs often in studies of correlated or repeated categorical measurements, e.g., in panels or social mobility studies, is a table with row and column classification variables measured on the same scale, which can be nominal or ordinal. Caussinus, in his pioneering work [1], introduced quasisymmetry ( Q S ) models for such square contingency tables, focusing on its mathematical properties and its connection to the models of complete symmetry (S) and marginal homogeneity ( M H ). The interpretation aspects were developed later by [2,3,4], among others. The Q S model applies on contingency tables that have a common nominal classification scale. An ordinal Q S ( O Q S ) model was introduced by [5].
In this work, we focus on the Q S model, penetrating its features by considering the alternative equivalent definitions of Q S in terms of cell probabilities, local odds ratios (LOR), and as a model measuring departure from the S model. Furthermore, outgoing from the fact that, under Q S , the table of LOR is symmetric, we discuss more special parsimonious Q S models that are based on the family of association models (AMs) with homogeneous row and column scores ([3,6]). In a information-theoretic setup, Q S and O Q S models were generalized through ϕ -divergence to the corresponding families of models Q S ( ϕ ) and O Q S ( ϕ ) ([7,8]). Associated orthogonal decomposition properties of the S model were proved for the Q S ( ϕ ) and O Q S ( ϕ ) models, respectively, by [7,9]. Further variants of ϕ -divergence Q S -type models and S decomposition properties were discussed in [10,11,12]. A detailed literature review on Q S models and links to other models of symmetry and asymmetry can be found in [13]. Here, we introduce a ϕ -divergence family of special Q S models that roots on the ϕ -divergence family of AMs ([14]).
Moving from LOR to generalized odds ratios, e.g., global odds ratios (GOR), Q S models were also considered for modeling the symmetry of generalized odds ratios ([15]). AMs for generalized odds ratios ([16]) were extended to a broader family through the ϕ -divergence ([17]). Combining these two families of models discussed in [15,17], we introduce here new flexible classes of Q S models for generalized odds ratios that are based on ϕ -divergence.
Overall, we revisit the Q S model:
  • Aiming at an indepth discussion of its nature and properties, as consolidated by the alternative possible definitions of Q S , and consideration of special Q S -type models, with an emphasis on Q S -type models that are based on homogeneous AMs.
  • Reviewing extensions of Q S -type models towards two directions: (a) in an information-theoretic setup by replacing the role of KL divergence by ϕ -divergence, and (b) considering them for generalized odds ratios other than LOR.
  • Proposing a new ϕ -divergence family of Q S -type models by expanding the relation of Q S to homogeneous AMs for the ϕ -divergence AMs.
  • Introducing a flexible family of models by extending models of ii(b) above in terms of the ϕ -divergence.
The paper is structured as follows. Section 2 reviews Q S and O Q S models, focusing on their structural properties, and further discusses AMs with homogeneous scores that are of the Q S -type. Section 3 presents Q S and O Q S models for generalized odds ratios. Generalized families of Q S and O Q S models for LOR based on ϕ divergence are briefly reviewed in Section 4. In Section 5, the Q S -type models of Section 2.2 and the Q S models for generalized odds ratios of Section 3 are expanded to corresponding ϕ divergence-based classes of models by modeling ϕ -scaled generalized odds ratios. A selection of the discussed models is illustrated on two examples in Section 6. Section 7 discusses further possible models that can be investigated. Section 8 summarizes our results.

2. Quasisymmetry Models for Square Contingency Tables

Consider an I × I contingency table cross-classifying two categorical variables X and Y, measured on the same scale, and corresponding to the rows and columns of the table. Let π = ( π i j ) be the associated probability table with cell entry probabilities π i j = P ( X = i , Y = j ) , for i , j = 1 , , I , where π i j [ 0 , 1 ] and i , j π i j = 1 . Then, the Q S model, initially introduced by Caussinus [1], is expressed in log-linear form as
log ( π i j ) = u + u i X + u j X + α j + u i j X Y , i , j = 1 , , I ,
with symmetric interactions, i.e., with
u i j X Y = u j i X Y , i < j , i , j = 1 , , I .
Parameters in (1) satisfy some identifiability constraints. We set
u 1 X = α 1 = 0 and u 1 i X Y = u i 1 X Y = 0 , i = 1 , , I .
The degrees of freedom of (1) equal d f ( Q S ) = ( I 1 ) ( I 2 ) / 2 . The α j parameters measure the departure from model of complete symmetry S, under which π i j = π j i = π i j S , for all i < j . Indeed, Q S is reduced to S if α j = 0 , for all j. Model (1) fits the diagonal entries exactly.
In terms of cell probabilities, (1) is equivalently expressed as
π i j = π i j S 2 c i c i + c j , i , j = 1 , , I ,
with parameters c i providing insight into sources of marginal inhomogeneity, as discussed in [7,18].
Alternatively, the Q S model can be defined in terms of the LOR
θ i j L = π i j π i + 1 , j + 1 π i + 1 , j π i , j + 1 , i , j = 1 , , I 1 ,
i.e., the odds ratios of all 2 × 2 subtables formed by pairs of successive rows and successive columns. These form an ( I 1 ) × ( I 1 ) contingency table, and QS is the model that has symmetric LOR
θ i j L = θ j i L , i < j , i , j = 1 , , I 1 ,
and fits the diagonal entries of the probability table exactly, as indicated in [19]. This definition through the LOR highlights a basic structural property of Q S , facilitating its physical interpretation and enabling generalizations in new directions by considering alternative types of odds ratios, as we show in Section 3.

2.1. QS Model for Ordinal Classification Variables

Usually in applications of the Q S model, the classification scale is ordinal. However, Q S is also applicable for tables with nominal classification variables, since it fulfils the permutation invariance property ([20]). In the case of an ordinal scale, alternative Q S -type models are possible that are more parsimonious and provide insightful interpretation. Agresti [5] introduced the ordinal  Q S ( O Q S ) model
log ( π i j ) = u + u i X + u j X + β · j + u i j X Y , i , j = 1 , , I ,
with interaction parameters satisfying (2) and under (3). In other words, O Q S is a special, more parsimonious Q S model, derived from (1) when α j = β · j . It has just one parameter more than the S model; hence, d f ( O Q S ) = d f ( S ) 1 = I ( I 1 ) / 2 1 . Equivalently, (7) can be expressed as
π i j = π j i δ i j , i j , i , j = 1 , , I ,
with β = log ( δ ) , or as a departure from symmetry model
π i j = π i j S δ i δ i + δ j , i j , i , j = 1 , , I .
Under the O Q S model, scores are assigned to the classification categories that equal the category indices ( μ j = j , j = 1 , , I ). It can be easily verified that these scores in (7) and (8) can equivalently be replaced by any equally spaced scores, i.e., linear transformation of μ j s. More generally, one could consider an O Q S model for known but unequally spaced scores, that is, for any set of known scores μ 1 μ 2 μ I (with μ 1 < μ I ). This model however will no more be equivalent to (7). Analogously to the Q S model, O Q S reduces to the S model when β = 0 (or δ = 1 ).

2.2. Association Models with Homogeneous Scores

Log-linear models with interactions for two-way contingency tables are saturated. In case of a square table, the saturated log-linear model is given by
log ( π i j ) = u + u i X + u j Y + u i j X Y , i , j = 1 , , I .
Association models (AMs) impose special structures on the interactions, thus leading to nonsaturated dependence models of sound interpretation. They are also known as Goodman’s AMs (see [6] and references therein). For a detailed discussion of AMs and the associated literature, we refer to [21] (Chapter 6); a short presentation is provided in [22].
The simplest association model is that of uniform association (U) that is applied on tables with ordinal classification variables and model interactions through just one parameter of intrinsic association on the basis of equidistant scores assigned to the categories of the classification variables. The U model for square contingency tables (with classification variables of common scale) can be expressed as
log ( π i j ) = u + u i X + u j Y + ζ μ i μ j , i , j = 1 , , I ,
where ζ is the intrinsic association parameter and μ 1 < μ 2 < μ I are known scores assigned to the classification categories, which are homogeneous for rows and columns, and equidistant for successive categories ( μ i + 1 μ i = δ > 0 ). Under (11), interaction parameters u i j X Y = ζ μ i μ j are obviously symmetric. Under the U model, all LORs are equal since
log ( θ i j L ) = ζ ( μ i + 1 μ i ) ( μ j + 1 μ j ) = ζ δ 2 , i , j = 1 , , I 1 ,
and thus (6) is trivially fulfilled. However, (11) is not of the Q S -type since it does not exactly fit on the diagonal. Its extension
log ( π i j ) = u + u i X + u j Y + d i I ( i = j ) + ζ μ i μ j , i , j = 1 , , I ,
where I is the indicator function, is the homogeneous uniform association model with exactly fitted diagonal entries, denoted by U h d , introduced by [3] as the uniform with main diagonal deleted model. It is a quasisymmetric model for ordinal classifications, more parsimonious than Q S with d f ( U h d ) = ( I 1 ) 2 I 1 . It is an alternative to O Q S and more parsimonious than O Q S for I > 4 .
Analogously to the O Q S model, Model (13) can be considered for arbitrary known scores μ 1 μ 2 μ I (with μ 1 < μ I ). Furthermore, considering expression (13) with unknown parametric scores, not necessarily ordered, the homogeneous R C model with exactly fitted diagonals ( R C h d ) is derived, which is another Q S -type model, less parsimonious than U h d that can apply also to nominal classification variables. For a discussion on U h d , R C h d and further homogeneous AMs of higher order (i.e., homogeneous R C ( K ) models) and their links to Q S , we refer e.g., to [21] (Section 9.4).

3. Generalized QS Models

In case one or both of the classification variables are ordinal, there exist other types of odds ratios that are alternatives to LOR. In our framework of square tables with classification variables measured on the same scale, of interest are, beyond LOR, odds ratios for tables with ordinal classification scale. The most popular type for ordinal classification variables is the global odds ratios (GOR), which for an I × I contingency table are defined as
θ i j G = π i ̲ , j ̲ / π i ̲ , j ¯ π i ¯ , j ̲ / π i ¯ , j ¯ = ( s i , k j π s k ) / ( s i , k > j π s k ) ( s > i , k j π s k ) / ( s > i , k > j π s k ) , i , j = 1 , , I 1 .
The characterization global is because every θ i j G is based on the whole contingency table since it dichotomizes X and Y at levels i and j, respectively, and accordingly merges the cell probabilities. GOR treat both classification variables in a symmetric manner. When merging is considered only for one classification variable, for example, Y, while the other is treated locally, then the cumulative odds ratios (COR) are derived
θ i j C Y = π i , j ̲ / π i , j ¯ π i + 1 , j ̲ / π i + 1 , j ¯ = ( k j π i k ) / ( k > j π i k ) ( k j π i + 1 , k ) / ( k > j π i + 1 , k ) , i , j = 1 , , I 1 .
COR can be used in problems of modeling the effect of an explanatory variable on a response. In particular, θ i j C Y could be considered if Y is the response. Obviously, θ i j C X can be analogously defined. For further types of generalized odds ratios and their detailed study, their inter-relations, and associated positive dependence properties, we refer to [23].
Motivated by the definition of Q S through the symmetric LOR, the authors in [15] introduced generalized Q S -type models for generalized odds ratios other than the LOR. In this context, the classical Q S model that applies on the LOR would be denoted by Q S L , while analogously to (6), the Q S property for the GOR is defined by
θ i j G = θ j i G , i < j , i , j = 1 , , I 1 .
and denoted by Q S G . On the other hand, the definition of the Q S C model for the COR requires to change also the role of the response variable
θ i j C Y = θ j i C X , i < j , i , j = 1 , , I 1 ,
as explained in [15].

4. ϕ -Divergence-Based Families of Generalized QS Models

Model Q S and O Q S can be defined as departure from complete symmetry models (see (4) and (9)). From a statistical information point of view, they share a common property. Both, under certain conditions (different for each model), are the closest models to complete symmetry when the distance in measured in terms of the KL divergence, as proved by [7,8] for Q S and O Q S , respectively. Furthermore, the authors in [7,8] introduced and studied general classes of Q S and O Q S models, derived by replacing the KL divergence by a family of divergences, ϕ -divergence, which includes the KL as special case.
In particular, ϕ -divergence is an important generalized measure for measuring divergence between two probability distributions. In our setup, if π = ( π i j ) , q = ( q i j ) are two discrete finite I × I probability distributions, the ϕ -divergence between q and π is given by
I ϕ ( q , π ) = i , j π i j ϕ ( q i j / π i j ) ,
where ϕ is a real-valued strictly convex function on [ 0 , ) with ϕ ( 1 ) = ϕ ( 1 ) = 0 , 0 ϕ ( 0 / 0 ) = 0 , 0 ϕ ( y / 0 ) = lim x ϕ ( x ) / x (see e.g., [24]). The KL divergence
I KL ( q , π ) = i , j q i j log ( q i j / π i j ) ,
is derived from (18) for ϕ ( x ) = x log x . Setting ϕ ( x ) = ( 1 x ) 2 , Pearson’s divergence is obtained, while ϕ ( x ) = x λ + 1 x λ ( λ + 1 ) , leads to the power divergence measure of Cressie and Read (CR) [25]
I λ C R ( q , π ) = 1 λ ( λ + 1 ) i = 1 K q i q i π i λ 1 , < λ < , λ 1 , 0 ,
which is a flexible parametric family itself, controlled by the parameter λ . For λ 0 , (20) converges to the I KL ( q , π ) , for λ = 1 it corresponds to Pearson’s divergence and for λ = 1 / 2 to the Hellinger divergence.
The ϕ -divergence-based Q S family of models ( Q S L ( ϕ ) ), introduced by [7], is defined by
π i j = π i j S F 1 ( α i + γ i j ) , i , j = 1 , , I ,
with γ i j = γ j i , where F 1 is the inverse function of F = ϕ . For the KL divergence ϕ ( x ) = x log ( x ) and (21) leads to model (4) while for the CR ϕ ( x ) = x λ + 1 x λ ( λ + 1 ) , the Q S L ( λ ) family of models
π i j = π i j S { 1 + λ ( α i + γ i j ) } 1 / λ , i , j = 1 , , I ,
is derived. In this context, the standard Q S model is denoted by Q S L ( 0 ) as a member of this family. The superscript L in these models notation indicates the fact that they impose symmetric restrictions on LOR.
Analogously, the generalized family of O Q S models, O Q S ( ϕ ) , is defined by
π i j = π i j S F 1 ( α μ i + γ i j ) , i , j = 1 , , I ,
with γ i j = γ j i and μ 1 , , μ I known monotonic equidistant scores (see [8]). O Q S ( ϕ ) reduces to (9) for ϕ ( x ) = x log ( x ) , while, for the CR divergence, O Q S ( λ ) is given by
π i j = π i j S { 1 + λ ( α μ i + γ i j ) } 1 / λ , i , j = 1 , , I .

5. New Families of ϕ -Divergence Generalized QS Models

The Q S -type models of Section 2.2 that are linked to AMs with homogenous row and column scores can be extended to ϕ -divergence-based families through the ϕ -divergence AMs of [14]. A brief presentation of the ϕ -divergence AMs and the underlying concept can be found in [22]. Here, we focus just on the families corresponding to the U model with homogeneous row and column scores. Associated ϕ -divergence-based family of models U h ( ϕ ) is defined as
π i j = π i + π + j F 1 α i + β j + ζ μ i μ j , i , j = 1 , , I ,
where π i + and π + j denote the i-th row and j-th column marginals respectively, i.e., π i + = i = 1 I π i j π + j = j = 1 I π i j . For the KL divergence, U h ( ϕ ) leads to model (11) while for the CR–divergence it takes the form
π i j = π i + π + j 1 λ + 1 + λ ( α 1 + β j + ζ μ i μ j ) 1 / λ , i , j = 1 , , I ,
denoted by U h ( λ ) . Hence the special Q S -type model U h d defined in (13), can be extended to a family of models
π i j = π i + π + j F 1 α i + β j + d i I ( i = j ) + ζ μ i μ j , i , j = 1 , , I ,
denoted by U h d ( ϕ ) . The model expression corresponding to CR divergence is denoted by U h d ( λ ) .
Furthermore, the Q S models for generalized odds ratios, introduced in [15] and presented in Section 3, can be extended to a flexible ϕ -divergence family, based on the ϕ -divergence generalized AMs of [17], which are briefly presented below, adjusted in our set–up.
Analogously to expression (12) for the U h L model, U h ( ϕ ) can alternatively be expressed as
θ i j L ( ϕ ) = ζ ( μ i + 1 μ i ) ( μ j + 1 μ j ) = ζ δ 2 , i , j = 1 , , I 1 ,
where
θ i j L ( ϕ ) = F π i j π i + π + j + F π i + 1 , j + 1 π i + 1 , + π + , j + 1 F π i + 1 , j π i + 1 , + π + j F π i , j + 1 π i + π + , j + 1 ,
for i , j = 1 , , I 1 , are measures of local dependence, scaled through the ϕ -divergence and denoted by LOR ( ϕ ) . For ϕ ( x ) = x log x , LOR ( ϕ ) is the log(LOR), modeled in (12), which in the sequel is denoted as θ i j L ( 0 ) = log ( θ i j L ) , i , j = 1 , , I 1 . For the CR divergence,(29) becomes
θ i j L ( λ ) = 1 λ π i j π i + π + j λ + π i + 1 , j + 1 π i + 1 , + π + , j + 1 λ π i + 1 , j π i + 1 , + π + j λ π i , j + 1 π i + π + , j + 1 λ .
Forcina and Kateri ([17]) provided expressions for ϕ -scaled generalized odds ratios and introduced families of ϕ -divergence AMs for generalized odds ratios, which they studied. Thus, for example, GOR and COR extend to GOR ( ϕ ) and COR ( ϕ ) , given by
θ i j G ( ϕ ) = F π i ̲ , j ̲ π i ̲ + π + j ̲ + F π i ¯ , j ¯ π i ¯ + π + j ¯ F π i ¯ , j ̲ π i ¯ + π + j ̲ F π i ̲ , j ¯ π i ̲ + π + j ¯ ,
and
θ i j C ( ϕ ) = F π i , j ̲ π i + π + j ̲ + F π i + 1 , j ¯ π i + 1 , + π + j ¯ F π i + 1 , j ̲ π i + 1 , + π + j ̲ F π i , j ¯ π i + π + j ¯ ,
for i , j = 1 , , I 1 . The merged probabilities in (31) and (32) are the same as defined in (14) and (15). For the KL divergence, (31) and (32) reduce to (14) and (15), while for the CR divergence, setting F ( x ) = 1 λ x λ , the corresponding expressions GOR ( λ ) and COR ( λ ) are derived. Through these ϕ -scaled generalized odds ratios, generalized Q S models, such as Q S G and Q S C , are extended to ϕ -divergence-based families of models by replacing in the definitions (16) and (17) the GOR and COR by GOR ( ϕ ) and COR ( ϕ ) . Q S -type models for other types of generalized odds ratios, introduced in [15], can be analogously extended to ϕ -divergence-based families.

6. Examples

We first illustrate Q S -type models on one of the most classical datasets of square tables, namely, the women vision data provided in Table 1. Apart from the standard Q S model fitted often in the literature, the author in [5] fitted on this dataset the O Q S model, while the authors in [7] the Q S L ( λ ) and in [8] the O Q S ( λ ) , for λ = 1 . Furthermore, the authors in [15] fitted Q S models for generalized odds ratios other than the LOR.
Our second example, provided in Table 2, cross-classifies male respondents of the 2008 General Social Survey (GSS) in the USA on the basis of their degree of pride with regard to America’s economic vs scientific and tech achievements. We applied on this dataset the same models as on the women-vision data.
In Table 3 we provide the likelihood ratio goodness of fit (GOF) test statistic values for Q S -type models fitted on the LOR for both examples. Table 4 shows the GOF test statistics for the generalized Q S model fitted on the GOR.
For the women-vision example, O Q S models were best. The scale in this case is not of practical importance, since Q S L ( 0 ) and Q S L ( 1 ) , corresponding to the KL and Pearsonian divergence, are of comparable fit (their maximum likelihood estimates (MLEs) differ on the second decimal place). The MLEs of the expected cell frequencies under Q S L ( 1 ) are provided in Table 1. This model, i.e., (24) with λ = 1 and μ i = i , i = 1 , , I , takes the final form (see [8])
π i j = π i j S { 1 + α ( i j ) } , i , j = 1 , , I .
The MLE for α is α ^ = 0.0534 < 0 and hence the probabilities in the lower triangle in Table 1 are estimated to be smaller than those in the upper. Hence the vision is worse for the left eye. Under this model, it holds
π i j π j i = 1 + α ( i j ) 1 + α ( j i ) , i < j .
Thus, the odds of an observation falling in a certain subdiagonal under the main diagonal of the table (instead of the corresponding superdiagonal) are estimated as π ^ i j π ^ j i = 1 0.0534 ( i j ) 1 0.0534 ( j i ) , i < j . Notice that in [8] the corresponding α ^ value is different (=0.119). This is due to rescaling, since there is used a different set of u i scores (the π ^ i j ’s are the same).
The situation is different for the second example, where it is clear that the KL divergence should be used for modeling the LOR (see Table 3). Furthermore we see that for this data set, the U h d model, that imposes a special parsimonious structure on the interaction terms and not on the main effects (as under the O Q S models), is of better fit. However, an impressive fit is provided by the Q S G models (see Table 4). The best fit is for λ = 0 and thus the MLEs of the expected cell frequencies under Q S G ( 0 ) are shown in Table 2. Hence, for this data set the Q S property is significantly stronger supported for the global (than the local) dependencies.
In our examples, we considered specific choices for the parameter λ . Analysis and interpretation of results follows analogously for other choices of the parameter λ .

7. Discussion

In future research, it would be interesting to consider more parsimonious Q S -type models for generalized odds ratios, analogs to U h d for LOR, as for example, the model of uniform GOR with homogeneous (equidistant) row and column scores, i.e., satisfying
log ( θ i j G ) = ζ δ 2 , i , j = 1 , , I 1 ,
that additionally fits the probabilities on the main diagonal cells exactly. AMs that model interactions other than local, though they are naturally defined by the corresponding type of odds ratios, do not provide closed form expressions for the individual cell probabilities. Hence, a definition of the associated Q S -type models by expressions analog to (13) is not possible. For the same reason, the O Q S model cannot be extended to other types of odds ratios by the approach adopted here. Since it imposes a special structure on the main effects, this cannot be captured when defining models in terms of odds ratios; expressions in terms of cell probabilities are required. Recently, the authors in [17] derived expressions for such generalized AMs in terms of suitable associated marginal probabilities. These expressions include parameters for the main effects (see Forms (9) and (10) in [17]). One could generalize U h d and O Q S for other types of odds ratios, adopting the framework of [17].

8. Conclusions

In this work, we revisited the Q S model for square contingency tables with commensurable classification variables and discussed its possible equivalent formulations. Q S is mostly expressed in terms of cell probabilities, while it can alternatively be expressed in terms of local odds ratios Its definition as a departure model from the more parsimonious model of complete symmetry provides additional interpretation features. Furthermore, we considered the O Q S model, a more parsimonious Q S -type model, applicable if the classification scale is ordinal, which imposes a special structure on the main effects of the model. On the other hand, further Q S -type models can be derived by considering a special structure for the interaction terms. This is possible through AMs with homogeneous row and column scores. In particular, by adding parameters to homogeneous AMs that ensure the exact fit on the diagonal entries of the contingency table, models are derived that model the off-diagonal cells and have symmetric interaction terms. Thus, they are of the Q S -type, but more parsimonious than the standard Q S model. All these models are related to LOR and model local dependencies of the table. Next we present how these models can be defined for other types of generalized odds ratios, reviewing the work of [15].
In a statistical information-theoretic setup, Q S -type models and AMs satisfy properties of closeness to a specific reference model, when their divergence from the reference model is measured in terms of KL divergence. The reference model is that of symmetry (for Q S and O Q S models) or independence (for AMs). Replacing KL divergence with ϕ -divergence, generalized families of AMs, Q S and O Q S models for LOR were considered by [7,8,14], respectively. The Q S model was linked to GOR in [16], while [15] introduced and implemented Q S models for GOR and other types of generalized odds ratios, without, however, considering the link to divergence measures and associated properties. The possible extension of these models in terms of ϕ -divergence was a topic for further research in [15]. Here, we extended these models to ϕ -scaled generalized odds ratios and linked them to corresponding AMs on the basis of the results and models discussed in [17]. We demonstrated the flexibility in modeling the classes of models discussed here by implementing and discussing some of these models on two representative examples.

Funding

This research received no external funding.

Institutional Review Board Statement

Not applicable.

Informed Consent Statement

Not applicable.

Data Availability Statement

The data of Table 2 can be found in https://gssdataexplorer.norc.org/ (accessed on 10 October 2021).

Acknowledgments

The author thanks the reviewers for their useful comments on an earlier version of the manuscript.

Conflicts of Interest

The author declares no conflict of interest.

References

  1. Caussinus, H. Contribution a l’analyse statistique des tableaux de correlation. In Annales de la Faculté des Sciences de Toulouse; Université Paul Sabatier: Toulouse, France, 1965; Volume 29, pp. 77–183. [Google Scholar]
  2. Bishop, Y.M.; Fienberg, S.E.; Holland, P.W. Discrete Multivariate Analysis: Theory and Practice; MIT Press: Cambridge, MA, USA, 1975. [Google Scholar]
  3. Goodman, L.A. Simple Models for the Analysis of Association in Cross-Classifications having Ordered Categories. J. Am. Stat. Assoc. 1979, 74, 537–552. [Google Scholar] [CrossRef]
  4. McCullagh, P. Some applications of quasisymmetry. Biometrika 1982, 69, 303–308. [Google Scholar] [CrossRef]
  5. Agresti, A. A simple diagonals–parameters symmetry and quasi-symmetry model. Stat. Probab. Lett. 1983, 1, 313–316. [Google Scholar] [CrossRef]
  6. Goodman, L.A. The analysis of cross-classified data having ordered and/or unordered categories: Association models, correlation models, and asymmetry models for contingency tables with or without missing entries. Ann. Stat. 1985, 13, 10–69. [Google Scholar] [CrossRef]
  7. Kateri, M.; Papaioannou, T. Asymmetry models for contingency tables. J. Am. Stat. Assoc. 1997, 92, 1124–1131. [Google Scholar] [CrossRef]
  8. Kateri, M.; Agresti, A. A class of ordinal quasi symmetry models for square contingency tables. Stat. Probab. Lett. 2007, 77, 598–603. [Google Scholar] [CrossRef]
  9. Saigusa, Y.; Tahata, K.; Tomizawa, S. Orthogonal decomposition of symmetry model using the ordinal quasi-symmetry model based on f-divergence for square contingency tables. Stat. Probab. Lett. 2015, 101, 33–37. [Google Scholar] [CrossRef] [Green Version]
  10. Tahata, K. On asymmetry models and decompositions of symmetry for square contingency tables with ordered categories. In Proceedings of the 32nd International Workshop on Statistical Modelling, Groningen, The Netherlands, 3–7 July 2017; Volume 2, pp. 231–234. [Google Scholar]
  11. Tahata, K. Separation of symmetry for square tables with ordinal categorical data. Jpn. J. Stat. Data Sci. 2020, 3, 469–484. [Google Scholar] [CrossRef]
  12. Fujisaw, K.; Tahata, K. Asymmetry model based on f-divergence and orthogonal decomposition of symmetry for square contingency tables with ordinal categories. SUT J. Math. 2020, 56, 39–53. [Google Scholar]
  13. Tahata, K.; Tomizawa, S. Symmetry and asymmetry models and decompositions of models for contingency tables. SUT J. Math. 2014, 50, 131–165. [Google Scholar]
  14. Kateri, M.; Papaioannou, T. f-divergence Association Models. Int. J. Math. Stat. Sci. 1994, 3, 179–203. [Google Scholar]
  15. Kateri, M.; Gottard, A.; Tarantola, C. Generalized quasi symmetry models for ordinal contingency tables. Aust. N. Z. J. Stat. 2017, 59, 239–253. [Google Scholar] [CrossRef]
  16. Bartolucci, F.; Forcina, A. Extended RC association models allowing for order restrictions and marginal modeling. J. Am. Stat. Assoc. 2002, 97, 1192–1199. [Google Scholar] [CrossRef]
  17. Forcina, A.; Kateri, M. A new general class of RC association models: Estimation and main properties. J. Multivar. Anal. 2021, 184, 104741. [Google Scholar] [CrossRef]
  18. Becker, M.P. Quasisymmetric models for the analysis of square contingency tables. J. R. Stat. Soc. Ser. (Methodol.) 1990, 52, 369–378. [Google Scholar] [CrossRef]
  19. Goodman, L.A. Multiplicative models for square contingency tables with ordered categories. Biometrika 1979, 66, 413–418. [Google Scholar] [CrossRef]
  20. McCullagh, P. A class of parametric models for the analysis of square contingency tables with ordered categories. Biometrika 1978, 65, 413–418. [Google Scholar] [CrossRef]
  21. Kateri, M. Contingency Table Analysis: Methods and Implementation Using R; Birkhäuser (Springer): New York, NY, USA, 2014. [Google Scholar]
  22. Kateri, M. ϕ-divergence in contingency table analysis. Entropy 2018, 20, 324. [Google Scholar] [CrossRef] [PubMed] [Green Version]
  23. Douglas, R.; Fienberg, S.E.; Lee, M.L.T.; Sampson, A.R.; Whitaker, L.R. Positive dependence concepts for ordinal contingency tables. In Topics in Statistical Dependence; Lecture Notes–Monograph Series; Block, H.W., Sampson, A.R., Savits, T.H., Eds.; Institute of Mathematical Statistics: Ann Arbor, MI, USA, 1990; pp. 189–202. [Google Scholar]
  24. Pardo, L. Statistical Inference Based on Divergence Measures; Chapman & Hall: New York, NY, USA, 2006. [Google Scholar]
  25. Cressie, N.; Read, T. Multinomial Goodness-of-Fit Tests. J. R. Stat. Soc. Ser. B 1984, 46, 440–464. [Google Scholar] [CrossRef]
Table 1. Cross-classification of 7477 women by unaided distance vision of right and left eyes. In parentheses are the MLEs of the expected frequencies under OQS ( 1 ) model.
Table 1. Cross-classification of 7477 women by unaided distance vision of right and left eyes. In parentheses are the MLEs of the expected frequencies under OQS ( 1 ) model.
Left Eye Grade
Right Eye GradeBestSecondThirdWorstTotal
best1520266 (263.35)124 (133.37)66 (59.17)1976
second234 (236.65)1512432 (418.20)78 (88.54)2256
third117 (107.63)362 (375.80)1772205 (202.25)2456
worst36 (42.83)82 (71.46)179 (181.74)492789
total1907222225078417477
Table 2. Cross-classification of male respondents of GSS 2008 survey by degree of proud (VP: very proud, SWP: somewhat proud, NVP: not very proud; NP: not proud at all) with regard to America’s economic achievements (rows) vs scientific and tech achievements (columns). In parentheses are the MLEs of the expected frequencies under QS G ( 0 ) model.
Table 2. Cross-classification of male respondents of GSS 2008 survey by degree of proud (VP: very proud, SWP: somewhat proud, NVP: not very proud; NP: not proud at all) with regard to America’s economic achievements (rows) vs scientific and tech achievements (columns). In parentheses are the MLEs of the expected frequencies under QS G ( 0 ) model.
Science and Tech
EconomicVPSWPNVPNPTotal
VP369 (368.81)59 (58.73)6 (6.84)1 (0.97)435
SWP226 (226.63)238 (237.69)10 (9.74)3 (2.85)477
NVP60 (59.16)67 (67.65)14 (13.78)2 (2.09)143
NP7 (7.063)16 (16.18)3 (2.83)2 (2.00)28
total6623803381083
Table 3. Goodness-of-fit test statistics (with corresponding P values in parentheses) for the Q S L ( λ ) and O Q S ( λ ) for λ { 0 , 1 } along with the U h d model fitted on data of Table 1 and Table 2.
Table 3. Goodness-of-fit test statistics (with corresponding P values in parentheses) for the Q S L ( λ ) and O Q S ( λ ) for λ { 0 , 1 } along with the U h d model fitted on data of Table 1 and Table 2.
Model df Table 1Table 2
Q S L ( 0 ) 37.2708 (0.0638)3.4181 (0.3315)
Q S L ( 1 ) 37.2620 (0.0640)20.1060 (0.0002)
O Q S ( 0 ) 57.2804 (0.2006)8.8867 (0.1137)
O Q S ( 1 ) 57.2708 (0.2013)47.7122 (< 10 5 )
U h d 461.3082 (< 10 5 )3.9002 (0.4197)
Table 4. Goodness-of-fit test statistics for the Q S G ( λ ) models for the GOR, fitted on the data of Table 1 and Table 2 for λ { 1 / 2 , 0 , 1 / 3 , 2 / 3 , 1 } . All models have d f = 3 .
Table 4. Goodness-of-fit test statistics for the Q S G ( λ ) models for the GOR, fitted on the data of Table 1 and Table 2 for λ { 1 / 2 , 0 , 1 / 3 , 2 / 3 , 1 } . All models have d f = 3 .
ModelTable 1Table 2
Q S G ( 1 / 2 ) 7.5474 (0.0564)0.4239 (0.9353)
Q S G ( 0 ) 7.6242 (0.0544)0.1652 (0.9830)
Q S G ( 1 / 3 ) 7.8872 (0.0484)0.2105 (0.9759)
Q S G ( 2 / 3 ) 8.4732 (0.0372)0.4437 (0.9311)
Q S G ( 1 ) 9.2519 (0.0261)0.8973 (0.8261)
Publisher’s Note: MDPI stays neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Share and Cite

MDPI and ACS Style

Kateri, M. Families of Generalized Quasisymmetry Models: A ϕ-Divergence Approach. Symmetry 2021, 13, 2297. https://0-doi-org.brum.beds.ac.uk/10.3390/sym13122297

AMA Style

Kateri M. Families of Generalized Quasisymmetry Models: A ϕ-Divergence Approach. Symmetry. 2021; 13(12):2297. https://0-doi-org.brum.beds.ac.uk/10.3390/sym13122297

Chicago/Turabian Style

Kateri, Maria. 2021. "Families of Generalized Quasisymmetry Models: A ϕ-Divergence Approach" Symmetry 13, no. 12: 2297. https://0-doi-org.brum.beds.ac.uk/10.3390/sym13122297

Note that from the first issue of 2016, this journal uses article numbers instead of page numbers. See further details here.

Article Metrics

Back to TopTop