INDEX
Explanations
descriptions of people's ages
the presence of unique contextual markers or specific formatting
New Auto-Interp
Negative Logits
_.
-0.79
*.
-0.67
destro
-0.66
agre
-0.65
behavi
-0.65
emale
-0.64
behav
-0.64
;)
-0.64
occas
-0.64
(*
-0.63
POSITIVE LOGITS
NASCAR
0.60
Hillary
0.60
Corbyn
0.59
Pokémon
0.58
Houth
0.56
Labour
0.56
Nintendo
0.54
zbollah
0.54
Brewers
0.54
Shin
0.54
Activations Density 1.830%