INDEX
Explanations
the word "re" followed by a high level of activation
the word "we" in various contexts, indicating a collective perspective or shared experiences
New Auto-Interp
Negative Logits
lapt
-0.80
BuyableInstoreAndOnline
-0.69
owl
-0.63
Seah
-0.62
Pebble
-0.62
freezes
-0.61
simulator
-0.61
layer
-0.60
deterrent
-0.59
Bernstein
-0.59
POSITIVE LOGITS
ngth
1.05
aper
1.03
apers
0.99
selves
0.85
becca
0.83
versible
0.83
claimer
0.82
acters
0.80
psons
0.79
ggie
0.78
Activations Density 0.037%