INDEX
Explanations
precise verbs related to observation or understanding
words related to perception and understanding
New Auto-Interp
Negative Logits
Interstitial
-0.70
OH
-0.70
rush
-0.69
USE
-0.66
rera
-0.64
hibit
-0.63
atro
-0.62
raid
-0.61
´
-0.61
BRE
-0.61
POSITIVE LOGITS
discern
1.02
ible
0.88
hest
0.86
glim
0.83
hent
0.82
iban
0.82
wered
0.82
iour
0.79
ibly
0.79
smanship
0.78
Activations Density 0.015%