INDEX
Explanations
instances of the word "but" indicating contrasts or exceptions
New Auto-Interp
Negative Logits
ConstraintMaker
-0.65
LookAnd
-0.60
ьаж
-0.58
brainly
-0.56
下载附件
-0.56
erializer
-0.52
Valentina
-0.51
itarianism
-0.51
Rohan
-0.50
ahon
-0.50
POSITIVE LOGITS
numerusform
0.66
KommentareTeilen
0.65
Monfieur
0.65
клопе
0.62
říklad
0.59
ſever
0.59
internetowa
0.57
Conſ
0.57
خارجية
0.57
liferay
0.57
Activations Density 0.008%