INDEX
Explanations
No Explanations Found
New Auto-Interp
Negative Logits
bass
-0.67
Balt
-0.65
inished
-0.64
ãĥ£
-0.63
Nor
-0.63
Pur
-0.62
pite
-0.61
bows
-0.60
parts
-0.58
ters
-0.57
POSITIVE LOGITS
Allaah
0.81
Mandela
0.81
Rowling
0.76
croft
0.75
Ivanka
0.75
Isis
0.73
Canaver
0.73
Ney
0.71
Manziel
0.71
Christensen
0.68
Activations Density 0.000%
No Known Activations
This feature has no known activations.