INDEX
Explanations
specific words related to technology features or specifications
references to types or categories, particularly in a technical or specification context
New Auto-Interp
Negative Logits
romeda
-0.90
olulu
-0.72
utical
-0.71
å§«
-0.70
ordial
-0.70
ITNESS
-0.69
yrinth
-0.68
pton
-0.67
nas
-0.67
ernel
-0.67
POSITIVE LOGITS
faces
1.25
face
1.17
ahead
0.85
casting
0.80
etter
0.78
etting
0.75
Script
0.71
inference
0.68
inker
0.68
oho
0.68
Activations Density 0.018%