INDEX
Explanations
specific mentions of individuals, organizations, agreements, or actions
statements or claims attributed to various authorities and reports
New Auto-Interp
Negative Logits
ependence
-0.89
erella
-0.75
izont
-0.73
Marginal
-0.73
eways
-0.68
livion
-0.68
inge
-0.66
Merit
-0.66
oliath
-0.65
emin
-0.63
POSITIVE LOGITS
"â̦
0.94
"[
0.89
"#
0.86
"@
0.83
"'
0.78
"...
0.76
Paddock
0.69
:[
0.66
spies
0.66
fabricated
0.66
Activations Density 0.537%