INDEX
Explanations
phrases that indicate negation or uncertainty in statements
New Auto-Interp
Negative Logits
almost
-0.68
Almost
-0.65
Almost
-0.64
almost
-0.61
hampir
-0.58
quase
-0.57
ThroughAttribute
-0.56
piuttosto
-0.54
scarcely
-0.54
հղումներ
-0.54
POSITIVE LOGITS
necessarily
1.72
necessariamente
1.33
necesariamente
1.32
necessarily
1.30
nécessairement
1.12
unbedingt
0.96
Necess
0.91
always
0.90
forcément
0.90
nødvendig
0.89
Activations Density 1.170%