INDEX
Explanations
terms related to legal or political issues
sentences or phrases that introduce additional information or elaboration
New Auto-Interp
Negative Logits
troubles
-0.71
roit
-0.68
entle
-0.67
leased
-0.64
ities
-0.62
itiz
-0.61
Merit
-0.60
acity
-0.60
undai
-0.60
avez
-0.59
POSITIVE LOGITS
namely
1.07
Provided
1.01
↵Âł
0.81
http
0.81
https
0.79
Whereas
0.71
Firstly
0.70
Journals
0.69
³³³³³³³³³³³³³³³³
0.67
"â̦
0.67
Activations Density 0.111%