INDEX
Explanations
strong expressions of emotion and enthusiasm in responses
New Auto-Interp
Negative Logits
незавершена
-1.20
nakalista
-1.16
صوتيه
-1.06
ⓧ
-1.04
AccessorTable
-1.02
Roskov
-1.02
estekak
-1.01
виправивши
-0.99
Normdatei
-0.98
twimg
-0.97
POSITIVE LOGITS
0.56
0.46
<eos>
0.44
http
0.44
it
0.43
_
0.43
↵↵
0.42
[
0.42
—
0.40
…
0.40
Activations Density 0.280%