INDEX
Explanations
phrases related to personal relationships and conflicts
New Auto-Interp
Negative Logits
ratulations
-0.75
merce
-0.75
sylv
-0.74
QUI
-0.73
clair
-0.73
TEXT
-0.72
SPONSORED
-0.72
worker
-0.71
Enlarge
-0.70
retty
-0.68
POSITIVE LOGITS
whence
1.02
afar
1.00
thence
0.84
obscurity
0.78
orbit
0.70
captivity
0.66
scratch
0.66
Brune
0.65
existence
0.65
premises
0.65
Activations Density 0.549%