INDEX
Explanations
phrases related to news and events
references to specific people, events, or conditions in a narrative context
New Auto-Interp
Negative Logits
voluntary
-0.50
hazard
-0.48
neg
-0.48
violation
-0.47
relinqu
-0.46
savings
-0.45
fooled
-0.45
voluntarily
-0.45
seizure
-0.45
potential
-0.45
POSITIVE LOGITS
âĦ¢:
0.67
ï¸ı
0.57
Elsewhere
0.55
0.55
rawdownloadcloneembedreportprint
0.52
Í
0.52
Scroll
0.50
ta
0.50
Flavoring
0.49
Across
0.48
Activations Density 1.194%