INDEX
Explanations
adverbs that modify actions or qualities in a descriptive manner
New Auto-Interp
Negative Logits
k
-0.60
B
-0.59
b
-0.58
al
-0.56
an
-0.56
z
-0.56
вица
-0.56
vicente
-0.55
uncios
-0.53
в
-0.52
POSITIVE LOGITS
AddTagHelper
1.00
BibitemShut
0.86
#
0.86
']")
0.85
"],
0.84
])]
0.83
SequentialGroup
0.81
sively
0.81
}%
0.80
"]),
0.80
Activations Density 0.584%