INDEX
Explanations
references to values and principles in various contexts
New Auto-Interp
Negative Logits
й
-0.79
DeWitt
-0.75
Bess
-0.75
Kongo
-0.73
Margot
-0.72
Mitar
-0.71
Antiquities
-0.65
giy
-0.65
Missy
-0.64
Pfund
-0.64
POSITIVE LOGITS
values
1.78
values
1.63
Values
1.58
VALUES
1.56
VALUES
1.47
Values
1.46
1.41
Werte
1.10
Valores
1.10
valores
1.05
Activations Density 0.133%