INDEX
Explanations
mentions of someone being foolish or acting in a silly manner
references to being foolish or deceived
New Auto-Interp
Negative Logits
Parenthood
-0.69
ials
-0.67
orney
-0.67
foreseen
-0.62
lined
-0.61
accompan
-0.61
apers
-0.60
ŃĶ
-0.60
apa
-0.59
Delivery
-0.59
POSITIVE LOGITS
hard
0.86
sonian
0.85
ery
0.85
proof
0.84
ingly
0.77
fooled
0.76
pas
0.76
ibility
0.74
naive
0.74
hemer
0.74
Activations Density 0.053%