INDEX
Explanations
words related to controversial or negative situations
New Auto-Interp
Negative Logits
backer
-0.72
ivities
-0.71
meal
-0.67
Prosper
-0.67
nomine
-0.66
bilt
-0.65
virtue
-0.65
Ceres
-0.64
Perse
-0.63
Galactic
-0.62
POSITIVE LOGITS
bing
1.36
yrinth
1.21
oard
1.18
ecause
1.17
elled
1.16
bles
1.15
bers
1.14
bed
1.14
ruary
1.13
dullah
1.11
Activations Density 1.775%