INDEX
Explanations
conjunctions and cause-and-effect relationships in the text
New Auto-Interp
Negative Logits
UnusedPrivate
-0.75
المعيارى
-0.73
дописавши
-0.71
GenerationType
-0.69
WriteTagHelper
-0.69
पया
-0.68
IsMutable
-0.66
蚪
-0.65
UserScript
-0.64
#+#
-0.64
POSITIVE LOGITS
although
0.56
since
0.50
they
0.50
gdyż
0.47
Although
0.47
Since
0.47
although
0.47
since
0.45
obwohl
0.45
chociaż
0.43
Activations Density 0.375%