INDEX
Explanations
instances related to people and their actions in various scenarios, such as traveling, playing, living, and being involved in legal matters
references to people, migration, or events involving groups and significant actions
New Auto-Interp
Negative Logits
`.
-0.76
ESE
-0.72
":-
-0.65
ECA
-0.62
>[
-0.60
enne
-0.59
retty
-0.58
/-
-0.57
DL
-0.57
','
-0.57
POSITIVE LOGITS
has
1.06
appears
1.02
disappears
1.02
was
0.99
expires
0.97
violates
0.97
enters
0.97
receives
0.95
survives
0.93
earns
0.92
Activations Density 0.591%