INDEX
Explanations
expressions of causation or reasoning
New Auto-Interp
Negative Logits
MethodManager
-0.57
PreferredItem
-0.56
للمعارف
-0.50
estekak
-0.50
UserScript
-0.48
IGraphics
-0.48
haikusbot
-0.48
पया
-0.48
richTextPanel
-0.47
unſ
-0.46
POSITIVE LOGITS
because
0.77
gdyż
0.74
เพราะ
0.69
because
0.69
Porque
0.68
ибо
0.66
denn
0.66
porque
0.65
因为
0.64
چون
0.64
Activations Density 0.514%