INDEX
Explanations
references to locations and time periods
instances of emphasis or concern in statements
New Auto-Interp
Negative Logits
çīĪ
-0.74
cffffcc
-0.69
ãĤ´ãĥ³
-0.69
emi
-0.64
chieve
-0.64
undrum
-0.63
reportedly
-0.63
APTER
-0.62
commented
-0.61
confir
-0.61
POSITIVE LOGITS
"â̦
0.86
nobody
0.78
there
0.78
"'
0.77
they
0.77
"[
0.74
they
0.71
there
0.71
terrorists
0.70
attackers
0.70
Activations Density 0.384%