INDEX
Explanations
words related to importance or emphasis
the presence of the word "be" in various forms and contexts
New Auto-Interp
Negative Logits
hoops
-0.69
Blocks
-0.61
Sr
-0.60
accommodation
-0.60
Kurd
-0.59
Kamp
-0.59
Orbit
-0.58
orbit
-0.58
impossibility
-0.58
hops
-0.58
POSITIVE LOGITS
hemoth
1.47
arers
1.42
fitting
1.34
ardless
1.32
heading
1.31
ige
1.31
league
1.29
acons
1.28
eping
1.23
agle
1.23
Activations Density 0.034%