INDEX
Explanations
mentions of particular organizations or events
percentage values or numeric data within the text
New Auto-Interp
Negative Logits
Morales
-0.70
Melvin
-0.69
loo
-0.66
Deity
-0.65
Watson
-0.62
Jacobs
-0.61
Swanson
-0.60
blinded
-0.59
Hurt
-0.59
inciner
-0.58
POSITIVE LOGITS
window
1.11
Loading
1.07
RAW
0.86
Topics
0.84
asters
0.84
âĵĺ
0.80
Posted
0.78
âĶĢâĶĢâĶĢâĶĢ
0.75
Trivia
0.75
clus
0.75
Activations Density 0.244%