INDEX
Explanations
affirmative or emphatic statements
New Auto-Interp
Negative Logits
المشاركات
-0.48
deepened
-0.48
Organisateur
-0.42
betweenstory
-0.42
flexGrow
-0.40
izedBox
-0.39
Biografi
-0.39
balleur
-0.38
Sheehan
-0.38
facilitated
-0.38
POSITIVE LOGITS
indeed
0.70
regard
0.61
regarded
0.61
indeed
0.58
Indeed
0.56
Indeed
0.54
regards
0.53
bowiem
0.50
regard
0.49
extAlignment
0.48
Activations Density 0.091%