INDEX
Explanations
phrases that express uncertainty or conditionality in statements
New Auto-Interp
Negative Logits
betweenstory
-1.00
ſelf
-0.86
myſelf
-0.86
himſelf
-0.79
Jefus
-0.79
μως
-0.79
Efq
-0.79
Monfieur
-0.77
themſelves
-0.76
ftate
-0.74
POSITIVE LOGITS
perhaps
0.79
שוליים
0.74
たまた
0.71
many
0.69
although
0.66
Hozzáférés
0.65
unlike
0.65
also
0.62
even
0.60
ultimately
0.60
Activations Density 4.746%