INDEX
Explanations
mentions of the existence or presence of people, entities, or things
phrases indicating location or existence of individuals or groups
New Auto-Interp
Negative Logits
OWS
-0.88
advertising
-0.70
ĪĴ
-0.68
ãĤµãĥ¼ãĥĨãĤ£ãĥ¯ãĥ³
-0.66
RESULTS
-0.64
omer
-0.62
ionage
-0.61
ãĤ´ãĥ³
-0.61
ãģķ
-0.59
20439
-0.58
POSITIVE LOGITS
nowhere
0.78
abouts
0.73
itialized
0.69
here
0.68
waiting
0.67
clusions
0.65
somewhere
0.65
limbo
0.63
yours
0.62
oln
0.61
Activations Density 0.257%