INDEX
Explanations
No Explanations Found
New Auto-Interp
Head Attr Weights
0:0.07
1:0.06
2:0.07
3:0.09
4:0.10
5:0.07
6:0.07
7:0.10
8:0.09
9:0.06
10:0.07
11:0.09
Negative Logits
veh
-1.87
��
-1.84
ldom
-1.81
charact
-1.78
millenn
-1.73
commer
-1.68
Palest
-1.66
Yug
-1.65
exposition
-1.65
fuzz
-1.60
POSITIVE LOGITS
appropriately
1.99
iologist
1.97
inet
1.92
ibr
1.88
doctor
1.87
manager
1.85
friends
1.83
ify
1.82
imeo
1.81
Episode
1.75
Activations Density 0.000%
No Known Activations
This feature has no known activations.