INDEX
Explanations
the word "happen" or variations of it in phrases describing future events
instances of the word "happen" and its variants to indicate events or occurrences
New Auto-Interp
Negative Logits
rador
-0.83
ilts
-0.74
edged
-0.70
urdy
-0.69
oyal
-0.68
Flavoring
-0.68
olded
-0.68
grown
-0.66
cius
-0.66
ravings
-0.65
POSITIVE LOGITS
uate
0.89
everywhere
0.80
NetMessage
0.75
anywhere
0.73
Tues
0.72
uates
0.71
anytime
0.71
happening
0.70
happen
0.70
Thurs
0.67
Activations Density 0.041%