INDEX
Explanations
No Explanations Found
New Auto-Interp
Head Attr Weights
0:0.08
1:0.05
2:0.09
3:0.07
4:0.08
5:0.07
6:0.08
7:0.07
8:0.09
9:0.07
10:0.10
11:0.09
Negative Logits
д
-2.02
ebook
-1.88
Sov
-1.72
м
-1.67
1976
-1.65
Lans
-1.62
MJ
-1.62
Hels
-1.61
nep
-1.61
[/
-1.56
POSITIVE LOGITS
ashington
2.09
measuring
1.77
debating
1.77
champions
1.72
overlooking
1.63
defining
1.58
democracies
1.58
ideon
1.57
empir
1.57
graded
1.56
Activations Density 0.000%
No Known Activations
This feature has no known activations.