INDEX
Explanations
adjectives or adverbs that express value judgments
adjectives and descriptors that convey strong qualifiers or evaluations
New Auto-Interp
Negative Logits
aminer
-0.62
udeb
-0.59
ioxide
-0.59
endment
-0.56
Revolution
-0.56
iphate
-0.56
ilet
-0.54
umo
-0.53
RPG
-0.53
ourke
-0.51
POSITIVE LOGITS
etheless
0.95
enough
0.67
entimes
0.65
ortunately
0.65
nesses
0.62
amounts
0.59
ly
0.59
ones
0.59
apologies
0.58
importantly
0.57
Activations Density 0.620%