INDEX
Explanations
phrases indicating authorship or leadership in a context
New Auto-Interp
Negative Logits
IsContent
-0.77
pleaſure
-0.74
tagHelper
-0.66
purpoſe
-0.62
ſche
-0.59
faſt
-0.59
mità
-0.58
riwal
-0.57
diſt
-0.56
houſe
-0.55
POSITIVE LOGITS
حوالہ
0.72
geslacht
0.68
Emeritus
0.65
oleh
0.62
emeritus
0.62
kasarigan
0.61
qualified
0.61
former
0.60
Lähteet
0.59
by
0.56
Activations Density 0.375%