INDEX
Explanations
proper nouns or names preceded by the word "Said"
reported speech or quotations
New Auto-Interp
Negative Logits
ä¿
-0.78
RAY
-0.71
ç¥ŀ
-0.71
æł
-0.70
istration
-0.70
illation
-0.67
ç«
-0.67
ä½ľ
-0.65
оÐ
-0.65
andering
-0.65
POSITIVE LOGITS
Ones
1.04
Doesn
1.01
Called
1.00
Alive
0.97
Them
0.96
Cause
0.96
Wrong
0.95
Makes
0.95
Difference
0.93
Gets
0.90
Activations Density 0.067%