INDEX
Explanations
references to individuals or people in various contexts
who followed by verb
New Auto-Interp
Negative Logits
<bos>
-0.41
,
-0.41
isEdit
-0.39
Sanford
-0.38
false
-0.38
Seitz
-0.37
logitech
-0.36
Gass
-0.35
'./../
-0.34
urnia
-0.33
POSITIVE LOGITS
who
1.21
którzy
1.09
kteří
0.97
who
0.94
Who
0.92
Who
0.91
whom
0.91
ktorí
0.87
który
0.83
الذين
0.81
Activations Density 0.049%