INDEX
Explanations
instances of objects or things being physically described or mentioned
occurrences of various common nouns and phrases related to existence and presence
New Auto-Interp
Negative Logits
enegger
-0.70
doms
-0.64
ctors
-0.63
nesses
-0.62
ocide
-0.60
anwhile
-0.60
}}}
-0.60
ivities
-0.60
=#
-0.59
rin
-0.58
POSITIVE LOGITS
whereby
0.82
indicating
0.74
pertaining
0.74
besides
0.74
outlining
0.73
lately
0.72
devoted
0.71
abouts
0.71
involving
0.69
»Ĵ
0.68
Activations Density 0.395%