INDEX
Explanations
mentions of specific scenarios or instances of something in documents
references to specific instances or occurrences of events
New Auto-Interp
Negative Logits
roe
-0.72
obil
-0.71
streng
-0.71
livest
-0.69
redit
-0.65
owered
-0.65
ACTED
-0.64
rica
-0.64
rote
-0.64
pez
-0.63
POSITIVE LOGITS
cases
1.35
cases
1.21
case
0.91
Cases
0.90
instances
0.89
backs
0.77
paces
0.77
rooms
0.76
Case
0.75
Canaver
0.74
Activations Density 0.013%