INDEX
Explanations
words related to physical structures and their condition or status
New Auto-Interp
Negative Logits
ixel
-0.15
Carthy
-0.14
cramped
-0.14
opak
-0.14
订
-0.14
AssemblyTitle
-0.13
avax
-0.13
ATRIX
-0.13
Ledger
-0.13
249
-0.13
POSITIVE LOGITS
abandoned
0.58
abandonment
0.53
abandon
0.53
deserted
0.38
fors
0.35
decay
0.35
andoned
0.35
abandoning
0.35
dil
0.31
å¼ĥ
0.31
Activations Density 0.203%