INDEX
Explanations
proper nouns or names
names and identifiers related to people, places, or organizations
New Auto-Interp
Negative Logits
Bord
-0.78
Worm
-0.77
Hu
-0.77
Henderson
-0.76
HAL
-0.76
Bai
-0.75
Letter
-0.74
Luk
-0.73
Maher
-0.73
HER
-0.72
POSITIVE LOGITS
ic
1.45
ici
1.27
nic
1.23
icer
1.22
IC
1.20
Nic
1.19
icc
1.17
ic
1.15
ics
1.13
ac
1.13
Activations Density 0.297%