INDEX
Explanations
phrases indicating authorship or attribution
from sender or editor
New Auto-Interp
Negative Logits
AddTagHelper
-0.46
przecież
-0.45
neſs
-0.44
canst
-0.44
tidaknya
-0.43
ußt
-0.42
ſelves
-0.42
Chwiliwch
-0.40
otheek
-0.39
læng
-0.39
POSITIVE LOGITS
AssemblyCompany
0.68
enumi
0.60
帖最后由
0.59
Frank
0.52
AssemblyProduct
0.50
enumii
0.50
labelledby
0.50
المعيارى
0.49
Thorn
0.48
horn
0.48
Activations Density 0.007%