INDEX
Explanations
expressions of excitement or enthusiasm related to performances and events
New Auto-Interp
Negative Logits
**********/
-0.75
Demikian
-0.71
}?>
-0.71
terrific
-0.70
")");
-0.69
Ανακτήθηκε
-0.69
Herzliche
-0.67
прочем
-0.64
daß
-0.64
AFX
-0.63
POSITIVE LOGITS
like
1.18
kind
1.02
kinda
0.96
yeah
0.84
Like
0.81
sort
0.80
—
0.80
Like
0.78
sorta
0.78
kind
0.76
Activations Density 0.222%