INDEX
Explanations
expressions of surprise or emphasis, particularly variations of "oh."
New Auto-Interp
Negative Logits
faſt
-0.69
againſt
-0.62
Theſe
-0.60
abstrait
-0.59
يتيمه
-0.58
AssemblyTitle
-0.58
dezelve
-0.57
raiſ
-0.57
tričko
-0.57
eventName
-0.56
POSITIVE LOGITS
oh
0.82
oh
0.70
OH
0.62
OH
0.62
dex
0.59
Oh
0.58
je
0.57
Jiang
0.55
ust
0.54
Oh
0.53
Activations Density 0.296%