INDEX
Explanations
phrases indicating conditional or consequential relationships
New Auto-Interp
Negative Logits
and
-0.44
honom
-0.43
الحره
-0.42
ślę
-0.42
/-
-0.40
őket
-0.38
Einzelnachweise
-0.38
Toujours
-0.37
lím
-0.37
always
-0.36
POSITIVE LOGITS
although
1.07
there
0.92
while
0.88
indeed
0.81
they
0.78
despite
0.78
although
0.77
unless
0.76
fact
0.75
whilst
0.75
Activations Density 0.418%