INDEX
Explanations
conjunctions and transition words
New Auto-Interp
Negative Logits
entanto
-0.72
Anyways
-0.63
Porém
-0.60
Maintenant
-0.60
Però
-0.59
them
-0.56
Shakspeare
-0.54
jetzt
-0.54
όμως
-0.53
fuckin
-0.52
POSITIVE LOGITS
it
0.71
"}";
0.71
sizeCache
0.70
recent
0.66
dibles
0.65
"");
0.64
he
0.63
The
0.61
—
0.61
IBLES
0.60
Activations Density 0.424%