INDEX
Explanations
phrases related to success and positive outcomes
New Auto-Interp
Negative Logits
解放
-0.46
inger
-0.45
zean
-0.45
firestore
-0.42
ingeki
-0.41
المل
-0.41
despre
-0.40
老人
-0.40
ying
-0.39
Herren
-0.39
POSITIVE LOGITS
success
1.09
Success
1.02
SUCCESS
1.01
successful
1.00
success
0.96
successful
0.96
sucesso
0.95
unsuccessful
0.95
צלחה
0.95
éxito
0.94
Activations Density 0.209%