INDEX
Explanations
concepts related to processes of facilitation, modification, and termination
Comes before prepositions/determiners
ending or purpose
New Auto-Interp
Negative Logits
ness
-0.89
ings
-0.80
iness
-0.80
est
-0.73
c
-0.64
-
-0.63
lec
-0.62
room
-0.61
ish
-0.60
ể
-0.60
POSITIVE LOGITS
myſelf
1.38
itſelf
1.28
themſelves
1.18
himſelf
1.08
ſeveral
1.05
pleaſure
1.03
purpoſe
1.02
againſt
1.01
ALLY
0.98
―――――
0.97
Activations Density 0.692%