INDEX
Explanations
instances of the word "Inspire" or its variants, emphasizing themes of motivation and encouragement
New Auto-Interp
Negative Logits
liness
-0.15
umberland
-0.15
attles
-0.15
attle
-0.15
agers
-0.15
INGTON
-0.14
ickey
-0.14
able
-0.14
onomic
-0.14
hv
-0.14
POSITIVE LOGITS
iring
0.34
ired
0.33
ire
0.31
iration
0.30
ires
0.26
ite
0.25
irus
0.22
pired
0.22
irit
0.21
ITE
0.21
Activations Density 0.005%