INDEX
Explanations
expressions of positive sentiment and approval
words of strong approval
New Auto-Interp
Negative Logits
__":
-0.57
esgue
-0.51
noDo
-0.50
FetchType
-0.48
RectangleBorder
-0.48
edance
-0.47
Rhestr
-0.46
semble
-0.45
kaŭ
-0.44
byshev
-0.43
POSITIVE LOGITS
yesterday
0.44
occaf
0.43
pleaſure
0.43
duquel
0.42
UnusedPrivate
0.42
Yesterday
0.42
anſ
0.40
createStatement
0.39
raiſ
0.39
régal
0.39
Activations Density 0.013%