INDEX
Explanations
No Explanations Found
New Auto-Interp
Head Attr Weights
0:0.09
1:0.10
2:0.06
3:0.07
4:0.08
5:0.07
6:0.09
7:0.07
8:0.08
9:0.08
10:0.08
11:0.07
Negative Logits
Specific
-2.75
Logged
-2.62
Beam
-2.49
ient
-2.48
TPPStreamerBot
-2.47
Cert
-2.43
pots
-2.43
Application
-2.40
ーク
-2.36
Qual
-2.35
POSITIVE LOGITS
Stras
3.13
Duc
3.11
Judaism
3.07
Yose
2.94
Auschwitz
2.86
Dj
2.85
Rabbi
2.81
Jordanian
2.79
Mog
2.75
Africans
2.73
Activations Density 0.000%
No Known Activations
This feature has no known activations.