INDEX
Explanations
adjectives expressing strong emotions or opinions
expressions of disbelief or incredulity
New Auto-Interp
Negative Logits
ascript
-0.89
onga
-0.73
ãĤ¼ãĤ¦ãĤ¹
-0.73
restling
-0.72
mentation
-0.71
HAEL
-0.71
igion
-0.71
atson
-0.68
20439
-0.67
inion
-0.66
POSITIVE LOGITS
bet
0.75
bookmark
0.71
innocuous
0.69
contradiction
0.65
favorably
0.63
paren
0.63
Zeal
0.62
quaint
0.62
obligated
0.60
uously
0.60
Activations Density 0.130%