INDEX
Explanations
expressions of surprise or exclamation
New Auto-Interp
Negative Logits
myſelf
-1.07
Efq
-0.95
themſelves
-0.88
houſe
-0.88
itſelf
-0.86
himſelf
-0.85
Majefty
-0.83
cdti
-0.82
useRouter
-0.80
againſt
-0.77
POSITIVE LOGITS
Oh
1.06
Oh
1.02
oh
0.91
oh
0.87
sweet
0.78
OH
0.76
sweet
0.68
мәкал
0.68
OH
0.67
Ohh
0.66
Activations Density 0.037%