INDEX
Explanations
phrases involving possessive pronouns or contractions related to ownership or association
New Auto-Interp
Negative Logits
non
-0.83
long
-0.75
"
-0.75
set
-0.75
so
-0.74
real
-0.73
far
-0.73
ronom
-0.72
fine
-0.71
mặt
-0.71
POSITIVE LOGITS
ſever
0.92
itſelf
0.90
houſe
0.89
مرئيه
0.87
greateſt
0.87
myſelf
0.86
Majefty
0.86
ſche
0.86
pleaſure
0.85
Houſe
0.84
Activations Density 0.033%