INDEX
Explanations
quantifiable attributes and actions
New Auto-Interp
Negative Logits
ຢູ່ໃນ
0.42
აღმასრულ
0.40
heiz
0.39
నాలను
0.37
గు
0.37
താണ്
0.37
šana
0.37
០០
0.37
ugs
0.36
ളം
0.36
POSITIVE LOGITS
visited
1.13
travelled
1.08
traveled
1.01
consulted
0.99
consumed
0.97
visited
0.96
Visited
0.95
encountered
0.93
eaten
0.91
flown
0.89
Activations Density 0.115%