INDEX
Explanations
specific entities or facts
various verbs or actions that are significant in context
New Auto-Interp
Negative Logits
verbs
-0.45
talk
-0.45
Barkley
-0.44
Engineers
-0.44
Ukip
-0.42
zai
-0.42
Cards
-0.42
Kee
-0.41
Indie
-0.41
Ki
-0.41
POSITIVE LOGITS
ufact
0.63
uled
0.56
Nap
0.55
arius
0.54
heimer
0.54
leep
0.54
verage
0.53
anche
0.53
entially
0.53
ouble
0.52
Activations Density 1.509%