INDEX
Explanations
phrases related to novel concepts or new initiatives
instances of the word "idea."
New Auto-Interp
Negative Logits
Ago
-0.74
ndum
-0.72
Peaks
-0.66
lake
-0.66
ILCS
-0.65
eworthy
-0.63
gar
-0.60
east
-0.60
Reporting
-0.59
lee
-0.59
POSITIVE LOGITS
ually
1.08
idea
0.82
atical
0.81
moot
0.81
yout
0.78
@#&
0.73
atics
0.73
uitive
0.73
SourceFile
0.71
ual
0.70
Activations Density 0.030%