INDEX
Explanations
names with hyphens
proper names, particularly those of people
New Auto-Interp
Negative Logits
indic
-0.68
headlines
-0.67
clutch
-0.65
counties
-0.65
Republicans
-0.65
airports
-0.64
HT
-0.64
OC
-0.63
Slip
-0.63
Raider
-0.63
POSITIVE LOGITS
Pierre
1.59
Cla
1.58
Louis
1.48
Marie
1.44
Fran
1.44
Paul
1.39
Marc
1.39
Luc
1.38
Philipp
1.38
Georg
1.37
Activations Density 0.035%