INDEX
Explanations
personal pronouns followed by verbs expressing an action or a state
common phrases and questions directed at the audience or reader
New Auto-Interp
Negative Logits
onto
-0.72
arget
-0.67
ologies
-0.67
Draw
-0.67
arta
-0.67
encies
-0.67
requires
-0.66
Adds
-0.65
etc
-0.65
Neither
-0.65
POSITIVE LOGITS
originally
0.89
last
0.86
conceived
0.81
instrumental
0.79
gracious
0.78
ãĤµãĥ¼ãĥĨãĤ£ãĥ¯ãĥ³
0.77
yesterday
0.74
KGB
0.70
revelation
0.70
initially
0.68
Activations Density 0.843%