INDEX
Explanations
instances of the word "of" followed by another word or phrase
phrases or terms that include the word "of."
New Auto-Interp
Negative Logits
zer
-0.64
partName
-0.56
iaries
-0.56
krit
-0.56
zers
-0.54
beam
-0.54
ynes
-0.54
zee
-0.53
violates
-0.53
agher
-0.53
POSITIVE LOGITS
course
1.85
sorts
1.72
course
1.44
theirs
1.24
ours
1.18
hers
1.18
mine
1.09
yours
1.07
COUR
1.06
Course
1.03
Activations Density 0.252%