INDEX
Explanations
vocabulary related to last resort decisions and options
New Auto-Interp
Negative Logits
Configurer
-0.15
herits
-0.14
жи
-0.14
argent
-0.14
žÃŃ
-0.14
ander
-0.14
sted
-0.14
ger
-0.13
lucent
-0.13
Ậ
-0.13
POSITIVE LOGITS
resort
0.45
resorts
0.36
Resort
0.35
recourse
0.34
drastic
0.31
desperation
0.31
desperate
0.30
extreme
0.30
option
0.29
option
0.27
Activations Density 0.226%