INDEX
Explanations
phrases indicating causal relationships or results
as a result of this
New Auto-Interp
Negative Logits
pleaſure
-0.62
purpoſe
-0.61
itſelf
-0.53
ſtand
-0.53
houſe
-0.52
myſelf
-0.52
ſche
-0.51
faſt
-0.48
Chriftian
-0.48
beſt
-0.48
POSITIVE LOGITS
результате
0.67
urma
0.63
akibat
0.63
infolge
0.63
rzez
0.60
نتيجة
0.60
resourceCulture
0.58
seguito
0.56
karena
0.56
ibatkan
0.55
Activations Density 0.026%