INDEX
Explanations
No Explanations Found
New Auto-Interp
Head Attr Weights
0:0.09
1:0.05
2:0.09
3:0.09
4:0.08
5:0.07
6:0.07
7:0.07
8:0.09
9:0.08
10:0.09
11:0.08
Negative Logits
acknowled
-1.75
adject
-1.71
verbs
-1.66
acknow
-1.60
denotes
-1.58
encrypt
-1.57
Chomsky
-1.56
################################
-1.56
sighting
-1.55
################
-1.54
POSITIVE LOGITS
drawn
1.89
rup
1.78
makers
1.78
rent
1.69
paced
1.68
visor
1.68
��
1.61
cgi
1.60
blight
1.55
plete
1.54
Activations Density 0.000%
No Known Activations
This feature has no known activations.