INDEX
Explanations
No Explanations Found
New Auto-Interp
Head Attr Weights
0:0.09
1:0.08
2:0.08
3:0.08
4:0.06
5:0.08
6:0.09
7:0.08
8:0.08
9:0.08
10:0.11
11:0.04
Negative Logits
️
-1.71
Pratt
-1.48
Blumenthal
-1.28
Clement
-1.26
Milo
-1.25
Innocent
-1.24
Pablo
-1.21
Gore
-1.20
Fantastic
-1.16
Caesar
-1.14
POSITIVE LOGITS
iversal
1.46
relegation
1.41
advant
1.37
iffe
1.37
rul
1.36
ipedia
1.33
iciary
1.32
erenn
1.32
aceae
1.31
foundland
1.30
Activations Density 0.000%
No Known Activations
This feature has no known activations.