INDEX
Explanations
descriptions of physical environments and their features
phrases describing actions or states that involve physical surroundings and interactions
New Auto-Interp
Negative Logits
é¾įåĸļ士
-0.66
ãĥ´ãĤ¡
-0.63
ONSORED
-0.63
NES
-0.62
EVA
-0.61
APTER
-0.61
alion
-0.60
çͰ
-0.58
ERO
-0.58
Ô
-0.57
POSITIVE LOGITS
abound
0.78
everywhere
0.62
prolifer
0.60
populate
0.59
hots
0.59
favour
0.56
themselves
0.56
favor
0.55
plentiful
0.55
roam
0.54
Activations Density 0.993%