INDEX
Explanations
instances of specific pronouns and articles
Followed by nouns in informal contexts
the instructions, game, problem, post
New Auto-Interp
Negative Logits
="#"><
-0.78
således
-0.71
asimismo
-0.70
précie
-0.70
noodzake
-0.67
sağlar
-0.67
さまざまな
-0.66
ciasc
-0.66
pertanto
-0.65
largely
-0.65
POSITIVE LOGITS
guy
1.08
damn
0.94
stupid
0.92
thing
0.91
whole
0.90
WHOLE
0.88
darn
0.81
pics
0.80
dude
0.80
whole
0.80
Activations Density 0.377%