INDEX
Explanations
phrases related to legal agreements or regulations
references to international agreements and policies
New Auto-Interp
Negative Logits
souven
-0.71
rehearsal
-0.70
prelim
-0.68
sibling
-0.66
resil
-0.64
ahime
-0.64
crew
-0.64
bung
-0.64
finalists
-0.62
exterior
-0.61
POSITIVE LOGITS
Professor
1.34
SPONSORED
1.28
Writing
1.17
He
1.15
Prof
1.11
Professor
1.03
His
1.02
Indeed
0.99
Others
0.95
Chomsky
0.94
Activations Density 0.491%