INDEX
Explanations
words related to physical objects, specifically plates
references to license plates
New Auto-Interp
Negative Logits
̶
-0.72
edy
-0.67
nell
-0.66
Finn
-0.63
Survivors
-0.62
Advis
-0.62
inki
-0.62
erness
-0.61
=-=-=-=-
-0.61
Colleges
-0.60
POSITIVE LOGITS
plate
4.13
plates
3.20
plate
3.06
Plate
2.80
plates
2.25
plaque
1.29
mound
1.21
dish
1.17
mantle
1.06
tray
1.00
Activations Density 0.020%