INDEX
Explanations
instances of the word "and."
New Auto-Interp
Negative Logits
amsung
-0.68
bint
-0.68
elfare
-0.67
okin
-0.67
eload
-0.66
endas
-0.65
ITECT
-0.64
المعيارى
-0.64
<?
-0.63
팎
-0.63
POSITIVE LOGITS
therefore
0.91
although
0.86
hence
0.75
поэтому
0.71
consequently
0.69
thus
0.67
although
0.64
this
0.62
därför
0.62
therefore
0.61
Activations Density 0.567%