INDEX
Explanations
the followed by abstract nouns
New Auto-Interp
Negative Logits
VIS
0.41
Determin
0.39
真實
0.39
რამდ
0.39
真实
0.38
šia
0.38
радиа
0.38
bárm
0.38
whats
0.37
نوشت
0.37
POSITIVE LOGITS
havoc
0.74
impression
0.70
sacrifices
0.63
compromises
0.63
impressions
0.62
manner
0.62
way
0.61
battles
0.60
lengths
0.59
inroads
0.59
Activations Density 0.028%