INDEX
Explanations
terms related to different types or categories within a specific domain
New Auto-Interp
Negative Logits
esses
-0.86
eks
-0.74
ĸļ
-0.74
lest
-0.73
oult
-0.72
lists
-0.71
Õ
-0.70
erves
-0.70
ellen
-0.70
Ô
-0.68
POSITIVE LOGITS
activity
0.84
equipment
0.83
measurement
0.81
firearm
0.81
person
0.81
clothing
0.81
weaponry
0.80
intermediate
0.78
medicine
0.78
behavior
0.78
Activations Density 0.071%