INDEX
Explanations
phrases that indicate alternatives or choices
New Auto-Interp
Negative Logits
itſelf
-0.94
―――――
-0.94
Houſe
-0.85
་་
-0.84
Shakspeare
-0.83
Anſ
-0.81
Majefty
-0.80
Jefus
-0.79
Weyl
-0.79
himſelf
-0.78
POSITIVE LOGITS
or
1.08
Or
1.06
otherwise
1.00
other
0.99
else
0.98
perhaps
0.95
oder
0.94
atau
0.93
یا
0.93
OR
0.92
Activations Density 0.219%