INDEX
Explanations
No Explanations Found
New Auto-Interp
Head Attr Weights
0:0.07
1:0.08
2:0.08
3:0.08
4:0.08
5:0.07
6:0.09
7:0.09
8:0.07
9:0.08
10:0.08
11:0.08
Negative Logits
onz
-1.93
uxe
-1.74
ote
-1.70
iton
-1.69
itures
-1.68
itor
-1.67
thereof
-1.66
Junior
-1.65
uce
-1.65
atories
-1.63
POSITIVE LOGITS
behavi
2.74
predic
2.13
millenn
2.12
enthusi
2.05
advoc
2.03
BIL
1.98
Notting
1.98
Niet
1.96
�
1.95
proble
1.93
Activations Density 0.000%
No Known Activations
This feature has no known activations.