INDEX
Explanations
the word "because" and its variants, indicating a focus on reasoning or justifications
New Auto-Interp
Negative Logits
Lockheed
-0.78
aquin
-0.77
Viited
-0.76
estimés
-0.74
nadzieję
-0.73
Nuorodos
-0.73
ſeveral
-0.71
archiviato
-0.70
ſta
-0.69
Ashanti
-0.69
POSITIVE LOGITS
Because
1.37
Because
1.31
ECAUSE
1.25
BECAUSE
1.21
because
1.18
because
1.08
Cuz
1.06
Porque
1.04
Cuz
0.99
porque
0.94
Activations Density 0.080%