INDEX
Explanations
mentions of readers or readership
references to the reader or audience
New Auto-Interp
Negative Logits
rament
-0.86
ering
-0.85
corrid
-0.74
eton
-0.71
remlin
-0.68
ela
-0.67
apes
-0.66
equality
-0.66
gypt
-0.65
apeake
-0.65
POSITIVE LOGITS
boys
0.84
Supported
0.79
Reader
0.79
extraord
0.78
="#
0.75
boy
0.75
idge
0.72
gate
0.72
APPLIC
0.71
sonian
0.70
Activations Density 0.033%