INDEX
Explanations
names of a specific person – Jay
New Auto-Interp
Negative Logits
milo
-0.81
iture
-0.79
ITIES
-0.78
ENTION
-0.75
htaking
-0.73
ACTED
-0.70
ancial
-0.69
idad
-0.68
IRE
-0.66
iments
-0.65
POSITIVE LOGITS
hawks
1.28
hawk
1.15
walking
1.01
haw
0.96
lon
0.93
den
0.92
bird
0.89
Jay
0.86
jay
0.86
pee
0.85
Activations Density 0.011%