INDEX
Explanations
expressions of frustration or disapproval regarding actions or situations
New Auto-Interp
Negative Logits
neither
-0.69
whichever
-0.69
sourcing
-0.65
Flavoring
-0.64
testament
-0.64
ONLY
-0.63
NEVER
-0.62
irrespective
-0.61
notwithstanding
-0.59
nowhere
-0.59
POSITIVE LOGITS
orthy
0.77
kered
0.66
mir
0.64
kinds
0.63
posure
0.62
ijah
0.61
bery
0.61
mosp
0.60
umer
0.58
ogg
0.57
Activations Density 1.354%