INDEX
Explanations
website URLs or numerical patterns
numerical identifiers or codes, particularly in a structured data context
New Auto-Interp
Negative Logits
Sav
-0.82
Sim
-0.78
Malt
-0.76
Manziel
-0.74
Virgin
-0.74
Isa
-0.72
Rand
-0.70
whisky
-0.69
Madagascar
-0.69
isman
-0.69
POSITIVE LOGITS
303
1.94
404
1.88
204
1.86
202
1.85
604
1.84
203
1.83
302
1.81
403
1.79
503
1.77
402
1.74
Activations Density 0.047%