INDEX
Explanations
adjectives followed by a hyphen and a number indicating degree
phrases related to negative conditions or criticisms
New Auto-Interp
Negative Logits
lisher
-0.70
Authors
-0.65
antioxid
-0.65
ħĭ
-0.64
£ı
-0.64
guiActiveUnfocused
-0.64
illon
-0.63
scissors
-0.62
Walls
-0.61
"$:/
-0.60
POSITIVE LOGITS
gotten
0.96
iquid
0.77
usive
0.76
untled
0.71
imm
0.67
utton
0.67
inen
0.66
ventures
0.65
nered
0.63
earth
0.63
Activations Density 0.044%