INDEX
Explanations
prepositions and phrases indicating positions or locations
phrases indicating spatial or positional contexts
New Auto-Interp
Negative Logits
cred
-0.64
assumes
-0.63
awaru
-0.62
alloc
-0.61
thereafter
-0.60
arising
-0.59
Attempts
-0.58
IRE
-0.58
subsequently
-0.58
allocations
-0.58
POSITIVE LOGITS
acebook
0.86
itable
0.84
lined
0.81
omorphic
0.80
uba
0.77
ited
0.77
ped
0.76
fed
0.76
active
0.76
pport
0.75
Activations Density 0.231%