INDEX
Explanations
Phrases related to comparisons between different entities or concepts
mentions of food-related research and its implications
New Auto-Interp
Negative Logits
)?
-0.74
)'
-0.56
)*
-0.56
?),
-0.56
?'
-0.55
!),
-0.54
*)
-0.52
)!
-0.52
-)
-0.51
)"
-0.51
POSITIVE LOGITS
frequency
0.44
instead
0.43
"#
0.43
DonaldTrump
0.41
behalf
0.40
olicited
0.40
Gateway
0.39
ocamp
0.39
itled
0.38
interstitial
0.38
Activations Density 2.496%