INDEX
Explanations
words related to actions or states of being, focusing on the verb form
verbs indicating actions or states of being
New Auto-Interp
Negative Logits
COURT
-0.74
Palestin
-0.65
suspic
-0.64
elligence
-0.63
DISTRICT
-0.63
avorite
-0.63
peak
-0.61
mosqu
-0.60
stride
-0.58
deadliest
-0.57
POSITIVE LOGITS
able
1.01
ings
0.94
eth
0.91
ables
0.90
ingly
0.89
backs
0.83
ties
0.82
ances
0.82
zeb
0.80
runs
0.77
Activations Density 0.381%