INDEX
Explanations
terms related to individuality and personal characteristics
New Auto-Interp
Negative Logits
cleros
-0.74
ç
-0.65
Hélène
-0.62
Koning
-0.62
CLO
-0.61
Fon
-0.60
Hailey
-0.59
CLO
-0.59
resh
-0.59
karş
-0.59
POSITIVE LOGITS
individual
2.17
Individual
2.17
INDIVIDUAL
2.15
Individual
2.13
individual
2.04
ividual
1.89
Individuals
1.85
Individuals
1.80
individuals
1.79
IVIDUAL
1.76
Activations Density 0.066%