INDEX
Explanations
locations, proper names, and words related to military and physical activities
New Auto-Interp
Negative Logits
ALSE
-0.62
»Ĵ
-0.56
ENDED
-0.56
OLOG
-0.55
Prohibition
-0.55
ACTION
-0.54
OLOGY
-0.54
200000
-0.54
essim
-0.52
ylum
-0.51
POSITIVE LOGITS
dale
0.73
heed
0.72
utenant
0.65
warm
0.61
vel
0.60
bone
0.59
boat
0.58
leaf
0.57
ãĤ¨ãĥ«
0.56
sey
0.56
Activations Density 10.900%