INDEX
Explanations
expressions of surprise or irony in narratives
New Auto-Interp
Negative Logits
koľvek
-0.53
Addo
-0.46
rrggbb
-0.44
setw
-0.42
añ
-0.41
Generals
-0.41
ктей
-0.41
肯定是
-0.41
bArr
-0.40
General
-0.39
POSITIVE LOGITS
oddly
0.96
]--;
0.94
strangely
0.93
ironically
0.93
contentLoaded
0.89
Surprisingly
0.86
weirdly
0.84
surprisingly
0.83
parado
0.81
竟
0.81
Activations Density 0.237%