INDEX
Explanations
mentions of the word "awesome"
expressions of awe or amazement
New Auto-Interp
Negative Logits
xual
-0.76
Townsend
-0.71
iod
-0.71
Feld
-0.68
cision
-0.68
ogue
-0.67
Kaepernick
-0.65
masturbation
-0.63
flares
-0.63
Luxem
-0.63
POSITIVE LOGITS
akening
1.30
akens
1.21
kward
1.17
ashington
1.09
esome
1.02
aii
0.98
yers
0.93
aw
0.92
dry
0.91
AW
0.91
Activations Density 0.012%