INDEX
Explanations
terms related to figures, measurements, and forms
New Auto-Interp
Negative Logits
stagnant
-0.62
jumper
-0.60
gorilla
-0.59
quar
-0.59
Constructed
-0.58
tightly
-0.57
Uran
-0.56
attendant
-0.56
universes
-0.55
nomine
-0.55
POSITIVE LOGITS
hiba
0.95
oshenko
0.87
ush
0.82
aret
0.78
eless
0.77
ago
0.77
ias
0.76
oan
0.75
ogo
0.75
ronic
0.74
Activations Density 0.021%