INDEX
Explanations
abstract concepts or ideas
phrases that begin with "the idea of."
New Auto-Interp
Negative Logits
ndum
-0.68
heast
-0.66
lake
-0.65
Ago
-0.65
ILCS
-0.65
eworthy
-0.65
roads
-0.63
ificant
-0.62
DF
-0.62
major
-0.61
POSITIVE LOGITS
ually
1.21
moot
0.87
idea
0.81
atical
0.81
atics
0.80
urally
0.74
uitive
0.74
matic
0.73
ual
0.71
yout
0.71
Activations Density 0.027%