INDEX
Explanations
references to educational institutions and individuals associated with them
references to Johns Hopkins University and its associated individuals
New Auto-Interp
Negative Logits
vice
-0.71
Helpful
-0.69
minent
-0.67
Cind
-0.65
zza
-0.65
regate
-0.64
Manit
-0.62
bernatorial
-0.61
tti
-0.60
razil
-0.59
POSITIVE LOGITS
kins
1.18
Hopkins
1.13
kinson
1.13
hip
1.02
ullivan
0.92
ELF
0.90
edge
0.87
haw
0.87
sein
0.80
CRIP
0.78
Activations Density 0.058%