INDEX
Explanations
adjectives and adverbial forms that describe emotional states or attitudes
New Auto-Interp
Negative Logits
==
-0.16
crow
-0.15
otp
-0.15
–
-0.15
леÑĩ
-0.14
hl
-0.13
zew
-0.13
mil
-0.13
enan
-0.13
ersions
-0.12
POSITIVE LOGITS
iew
0.15
!
0.14
ovit
0.14
rak
0.14
806
0.14
!")
0.14
amac
0.14
?↵↵↵
0.14
!]
0.13
SmartPointer
0.13
Activations Density 0.000%