INDEX
Explanations
proper nouns or specific names of entities
occurrences of the phrase "are" and "were" in various contexts
New Auto-Interp
Negative Logits
lapse
-0.75
diseng
-0.69
onding
-0.67
century
-0.66
bis
-0.65
guiActiveUnfocused
-0.65
operation
-0.62
process
-0.61
transition
-0.60
regulate
-0.60
POSITIVE LOGITS
*:
0.83
staples
0.71
ATHER
0.70
Cosponsors
0.69
nods
0.68
finalists
0.68
ngth
0.68
representatives
0.66
namely
0.65
Bey
0.65
Activations Density 0.269%