INDEX
Explanations
words related to foundational aspects, origins, or primary causes
references to foundational or fundamental concepts, often related to causes or origins
New Auto-Interp
Negative Logits
eers
-0.73
psey
-0.72
hemat
-0.70
disadvant
-0.69
Tonight
-0.67
Dragonbound
-0.66
glers
-0.65
abwe
-0.65
ammy
-0.65
ques
-0.64
POSITIVE LOGITS
kit
1.03
canal
1.00
beer
0.96
stock
0.95
cellar
0.92
stocks
0.90
Canal
0.81
arious
0.81
Cause
0.80
less
0.78
Activations Density 0.040%