INDEX
Explanations
information related to specific entities or topics
subjective pronouns and verb phrases indicating actions or states
New Auto-Interp
Negative Logits
emption
-0.75
cock
-0.72
itely
-0.65
endment
-0.64
icka
-0.63
gered
-0.63
oller
-0.62
76561
-0.61
messing
-0.60
Compatibility
-0.59
POSITIVE LOGITS
mammoth
0.68
paved
0.63
bes
0.62
cli
0.62
Palestin
0.59
nevertheless
0.58
nonetheless
0.58
doomed
0.57
fraught
0.57
ls
0.57
Activations Density 0.347%