INDEX
Explanations
No Explanations Found
New Auto-Interp
Head Attr Weights
0:0.07
1:0.06
2:0.09
3:0.07
4:0.08
5:0.08
6:0.08
7:0.08
8:0.07
9:0.09
10:0.09
11:0.10
Negative Logits
sporting
-1.73
smack
-1.65
Exhibit
-1.60
congr
-1.54
Comedy
-1.51
Kanye
-1.51
reel
-1.48
Wim
-1.47
unveiling
-1.46
Lect
-1.46
POSITIVE LOGITS
ntil
1.82
isance
1.69
iard
1.69
quit
1.61
om
1.58
ignt
1.58
pox
1.57
irable
1.56
Residents
1.56
ggle
1.55
Activations Density 0.000%
No Known Activations
This feature has no known activations.