INDEX
Explanations
No Explanations Found
New Auto-Interp
Head Attr Weights
0:0.07
1:0.06
2:0.08
3:0.09
4:0.09
5:0.08
6:0.08
7:0.07
8:0.08
9:0.07
10:0.08
11:0.09
Negative Logits
fits
-1.98
abet
-1.78
ims
-1.74
abus
-1.64
videos
-1.63
serv
-1.61
riages
-1.58
compet
-1.53
comings
-1.52
amina
-1.50
POSITIVE LOGITS
Grant
1.85
Grant
1.61
Minute
1.49
Woodward
1.45
WER
1.45
Harold
1.44
KT
1.41
Cliff
1.39
Grizz
1.36
Lloyd
1.35
Activations Density 0.000%
No Known Activations
This feature has no known activations.