INDEX
Explanations
numerical values followed by words
numerical values and their occurrences within text
New Auto-Interp
Negative Logits
helicop
-0.64
Niet
-0.63
Azerb
-0.62
iterranean
-0.60
Seym
-0.60
edIn
-0.60
Category
-0.59
Borders
-0.58
(*
-0.58
disadvant
-0.57
POSITIVE LOGITS
][
0.77
]
0.65
)]
0.60
Falcon
0.60
)
0.59
azes
0.59
].
0.59
));
0.58
);
0.57
)))
0.57
Activations Density 0.059%