INDEX
Explanations
references to individuals or groups of people
New Auto-Interp
Negative Logits
iscri
-0.49
Utrecht
-0.47
pregi
-0.47
הכ
-0.42
ihnachts
-0.42
交
-0.41
Dry
-0.40
inizio
-0.39
Maple
-0.39
old
-0.38
POSITIVE LOGITS
anyone
1.09
everyone
1.08
someone
1.02
anyone
0.96
everyone
0.95
ANYONE
0.95
Someone
0.94
Anyone
0.91
EVERYONE
0.91
someone
0.91
Activations Density 0.124%