INDEX
Explanations
the word "for" and related phrases indicating purpose or reason
New Auto-Interp
Negative Logits
iada
-0.55
ışı
-0.54
cortesía
-0.52
withIdentifier
-0.50
انتهای
-0.49
voorbeeld
-0.49
ardin
-0.49
Hotspur
-0.49
casco
-0.48
thang
-0.48
POSITIVE LOGITS
there
1.05
there
0.81
they
0.80
Sebab
0.74
Ведь
0.72
Ведь
0.68
we
0.67
THERE
0.65
although
0.65
it
0.65
Activations Density 0.185%