INDEX
Explanations
specific scientific or biological terms related to entities or concepts, particularly those involving acronyms and notations
Two or three letter acronyms
secret codes
New Auto-Interp
Negative Logits
n
-1.15
r
-1.02
k
-1.00
l
-0.99
m
-0.98
t
-0.95
ng
-0.84
b
-0.83
lin
-0.80
d
-0.78
POSITIVE LOGITS
poffe
1.01
elfare
1.00
posedge
0.99
Jefus
0.98
wiſe
0.97
raiſ
0.97
annica
0.96
myſelf
0.95
łucha
0.94
LDA
0.94
Activations Density 1.001%