INDEX
Explanations
No Explanations Found
New Auto-Interp
Head Attr Weights
0:0.08
1:0.06
2:0.07
3:0.08
4:0.09
5:0.09
6:0.07
7:0.08
8:0.08
9:0.08
10:0.09
11:0.09
Negative Logits
Pa
-1.57
sung
-1.55
_-_
-1.50
perpetrated
-1.48
fond
-1.46
Mane
-1.44
Against
-1.44
Bj
-1.42
jer
-1.37
Wisconsin
-1.37
POSITIVE LOGITS
artments
1.93
escape
1.78
sheets
1.74
sheet
1.57
tained
1.55
cius
1.54
outer
1.52
azine
1.51
elist
1.50
guide
1.50
Activations Density 0.000%
No Known Activations
This feature has no known activations.