INDEX
Explanations
references to knowledge and recognition about public figures or subjects
New Auto-Interp
Negative Logits
pdata
-0.16
anity
-0.16
ç´į
-0.15
/Private
-0.14
ancode
-0.14
rý
-0.14
ENA
-0.14
prospect
-0.14
änge
-0.14
apat
-0.14
POSITIVE LOGITS
correct
0.19
ignorance
0.18
knowledge
0.18
know
0.16
facts
0.16
knew
0.16
names
0.16
fact
0.16
name
0.16
basic
0.15
Activations Density 0.227%