INDEX
Explanations
the particle "を" (wo), which indicates the object of a sentence in Japanese
New Auto-Interp
Negative Logits
purpoſe
-1.02
houſe
-0.95
Jefus
-0.94
itſelf
-0.92
pleaſure
-0.92
Chriftian
-0.91
myſelf
-0.89
Chriſt
-0.89
Monfieur
-0.88
iſt
-0.87
POSITIVE LOGITS
を
2.55
를
2.00
을
1.93
を
1.59
를
1.38
いを
1.36
을
1.35
子を
1.32
りを
1.30
曲を
1.20
Activations Density 0.018%