INDEX
Explanations
proper nouns related to individuals
proper nouns or names, particularly those related to individuals or organizations
New Auto-Interp
Negative Logits
OLOGY
-0.76
ACTED
-0.76
advertisement
-0.68
institutional
-0.65
systematic
-0.64
achusetts
-0.64
itutional
-0.64
eanor
-0.64
OLOG
-0.64
ocrats
-0.63
POSITIVE LOGITS
Kur
1.22
istani
1.07
ernel
1.00
thur
0.93
assic
0.90
rier
0.88
geon
0.87
Zar
0.85
immune
0.84
ihara
0.83
Activations Density 0.005%