INDEX
Explanations
references to social contexts and collective experiences
New Auto-Interp
Negative Logits
Ra
-0.65
Peshawar
-0.64
Hoover
-0.61
்கள்
-0.61
Tash
-0.61
Holt
-0.61
TLR
-0.60
Hart
-0.60
R
-0.60
L
-0.57
POSITIVE LOGITS
AMONG
1.86
Amongst
1.79
Among
1.67
among
1.67
among
1.65
amongst
1.61
Among
1.50
parmi
1.34
Среди
1.24
среди
1.23
Activations Density 0.049%