INDEX
Explanations
proper nouns and entities such as names, locations, and titles
New Auto-Interp
Negative Logits
VIDEOS
-0.89
views
-0.76
inar
-0.71
sticks
-0.69
Characters
-0.67
ophobia
-0.66
aris
-0.66
malink
-0.65
olars
-0.65
estyles
-0.63
POSITIVE LOGITS
notorious
1.01
former
1.00
famed
0.97
grandson
0.94
famous
0.90
nephew
0.89
infamous
0.86
deceased
0.85
charismatic
0.83
granddaughter
0.83
Activations Density 0.220%