INDEX
Explanations
No Explanations Found
New Auto-Interp
Head Attr Weights
0:0.08
1:0.05
2:0.08
3:0.08
4:0.08
5:0.08
6:0.08
7:0.09
8:0.08
9:0.08
10:0.08
11:0.09
Negative Logits
rife
-2.04
lawy
-2.03
akin
-2.00
riddled
-1.92
begging
-1.86
Azerb
-1.82
defiant
-1.80
upholding
-1.78
policeman
-1.72
fraught
-1.71
POSITIVE LOGITS
isk
1.94
obook
1.78
Meg
1.70
Jelly
1.66
otte
1.64
omer
1.63
amba
1.61
ftime
1.60
quart
1.58
Rx
1.57
Activations Density 0.000%
No Known Activations
This feature has no known activations.