INDEX
Explanations
references to social interactions and events involving people
New Auto-Interp
Negative Logits
ENUM
-0.15
105
-0.14
onaut
-0.13
orrh
-0.13
foresee
-0.13
adlo
-0.13
iento
-0.13
Fly
-0.12
ãģ£ãģ±
-0.12
åĵŃ
-0.12
POSITIVE LOGITS
introduced
0.34
introdu
0.34
introduce
0.33
introduction
0.31
introducing
0.29
Introduced
0.29
introduces
0.28
approach
0.26
greeting
0.25
shake
0.24
Activations Density 0.425%