INDEX
Explanations
individuals and their contributions in various contexts, particularly in education and science
New Auto-Interp
Negative Logits
compr
-0.87
coincide
-0.71
menace
-0.69
disparate
-0.69
fallout
-0.68
chop
-0.68
proport
-0.67
livest
-0.66
turbulence
-0.66
camera
-0.66
POSITIVE LOGITS
Markus
0.96
Claus
0.96
Jen
0.94
Alvin
0.93
Edward
0.92
William
0.91
Daniel
0.90
Robert
0.90
Edwin
0.88
Fred
0.87
Activations Density 0.027%