INDEX
Explanations
instances of the word "present" or related forms
references to the concept of "present" in various contexts
New Auto-Interp
Negative Logits
STAR
-0.75
weed
-0.67
imov
-0.67
isexual
-0.67
terness
-0.66
jar
-0.66
kick
-0.63
Runs
-0.62
lez
-0.61
BET
-0.61
POSITIVE LOGITS
iment
1.38
iments
1.22
tense
1.09
ational
1.02
imental
1.02
encing
0.96
eering
0.93
able
0.92
invention
0.89
ative
0.86
Activations Density 0.042%