INDEX
Explanations
names of individuals relating to academia and research
New Auto-Interp
Negative Logits
alam
-0.90
GROUND
-0.82
RECT
-0.79
cipline
-0.74
keeper
-0.73
arers
-0.73
unal
-0.73
taboola
-0.73
wark
-0.72
AKING
-0.72
POSITIVE LOGITS
imus
1.37
imil
1.31
imize
1.06
Payne
1.02
imal
1.00
Weber
0.97
itar
0.96
imen
0.94
imens
0.94
Scher
0.91
Activations Density 7.580%