INDEX
Explanations
phrases related to specific numerical values
the occurrences of the word "at" in various contexts
New Auto-Interp
Negative Logits
Lens
-0.77
vous
-0.76
FTWARE
-0.75
PLA
-0.71
Russ
-0.71
birds
-0.70
Winged
-0.66
fill
-0.66
clip
-0.64
lish
-0.64
POSITIVE LOGITS
least
1.23
abase
1.08
onement
0.98
rial
0.89
roph
0.84
oned
0.83
dusk
0.82
intervals
0.81
rophic
0.80
omic
0.80
Activations Density 0.191%