INDEX
Explanations
prepositions followed by specific nouns or modifiers
New Auto-Interp
Negative Logits
comed
-0.67
regards
-0.65
NS
-0.58
KK
-0.58
natureconservancy
-0.56
wcs
-0.55
reassuring
-0.53
NW
-0.53
uilt
-0.53
Jae
-0.52
POSITIVE LOGITS
orem
0.74
iciency
0.69
eat
0.68
choice
0.66
rage
0.65
gression
0.63
speech
0.63
parency
0.61
Lear
0.61
Excellence
0.60
Activations Density 0.135%