INDEX
Explanations
references to personal relationships and emotional connections
Pronouns followed by auxiliary verbs
pronouns followed by verbs
New Auto-Interp
Negative Logits
Giving
-0.54
giving
-0.48
ervan
-0.47
Giving
-0.47
있어
-0.46
ligiloj
-0.45
glLoadIdentity
-0.45
Conduct
-0.45
kijken
-0.45
agir
-0.45
POSITIVE LOGITS
sorely
0.68
hopefully
0.67
zuvor
0.67
thankfully
0.66
extAlignment
0.63
+#+#
0.63
luckily
0.63
"]));
0.62
RegisterType
0.61
later
0.61
Activations Density 0.289%