INDEX
Explanations
references to "platinum" in various contexts
New Auto-Interp
Negative Logits
eering
-1.01
eers
-0.80
alter
-0.76
LER
-0.72
rir
-0.72
alez
-0.70
hani
-0.68
eller
-0.67
ellar
-0.66
arist
-0.66
POSITIVE LOGITS
atinum
1.18
platinum
0.88
pudding
0.82
Platinum
0.79
dioxide
0.74
Diamond
0.71
Edition
0.70
odies
0.69
plaque
0.67
tier
0.65
Activations Density 0.005%