INDEX
Explanations
phrases containing the word "though."
phrases indicating conditional or hypothetical situations
New Auto-Interp
Negative Logits
Winged
-0.66
AZ
-0.63
Bless
-0.63
Horses
-0.62
vant
-0.62
backer
-0.62
Offline
-0.61
Highlights
-0.61
Bas
-0.61
03
-0.61
POSITIVE LOGITS
acho
0.69
ovie
0.68
"$:/
0.67
causation
0.66
AMI
0.65
lihood
0.64
oche
0.64
icago
0.62
antha
0.62
tha
0.61
Activations Density 0.006%