INDEX
Explanations
references to specific years and dates
references to notable events or individuals associated with historical context
New Auto-Interp
Negative Logits
Ari
-0.93
Nid
-0.82
CLE
-0.79
VG
-0.79
Vu
-0.79
natureconservancy
-0.78
vv
-0.75
Sloven
-0.73
Ari
-0.72
veland
-0.71
POSITIVE LOGITS
Ham
2.62
Ham
2.52
HAM
1.84
ham
1.77
ham
1.70
Hampton
1.68
HAM
1.66
Hammond
1.44
Hamm
1.42
Hammer
1.41
Activations Density 0.216%