INDEX
Explanations
json structures with proxies
New Auto-Interp
Negative Logits
भूमिका
0.42
smoothing
0.39
емо
0.38
masculino
0.37
بە
0.37
Looking
0.37
embodying
0.37
Smoothing
0.37
मोबाइल
0.36
Gene
0.35
POSITIVE LOGITS
prox
0.50
proxies
0.44
CAC
0.44
metac
0.44
clamp
0.43
oprop
0.42
clamps
0.41
dict
0.41
proxy
0.40
foc
0.39
Activations Density 0.006%