INDEX
Explanations
proper nouns or names of people/places along with the verb "knows" indicating knowledge or awareness
statements of knowledge or awareness
New Auto-Interp
Negative Logits
phrine
-0.86
oples
-0.84
interstitial
-0.84
otion
-0.83
aredevil
-0.77
isco
-0.75
ItemTracker
-0.74
ermanent
-0.73
issions
-0.70
nesota
-0.70
POSITIVE LOGITS
ledged
1.19
ledge
1.08
lege
0.90
LED
0.86
how
0.82
ariat
0.78
ingly
0.75
nothing
0.73
beforehand
0.73
nothing
0.72
Activations Density 0.068%