INDEX
Explanations
expressions of surprise or shock
New Auto-Interp
Negative Logits
NSCoder
-0.70
Италијани
-0.68
EndGlobalSection
-0.68
للاسماء
-0.67
出版年
-0.63
ویکیآمباردا
-0.63
الحره
-0.62
otomatig
-0.60
Tikang
-0.57
Hentet
-0.57
POSITIVE LOGITS
smiled
0.61
grinned
0.48
stared
0.48
sighed
0.45
nodded
0.43
smirked
0.41
glanced
0.40
shrugged
0.39
frowned
0.39
smile
0.39
Activations Density 0.385%