INDEX
Explanations
references to specific locations or 'spots' within the text
New Auto-Interp
Negative Logits
íncia
-0.75
Maier
-0.74
קישורים
-0.69
egis
-0.67
Kenner
-0.66
ölkerung
-0.65
weetened
-0.64
Hydra
-0.64
Ayres
-0.64
مشين
-0.64
POSITIVE LOGITS
spot
3.58
Spot
3.44
spot
3.23
Spot
3.19
SPOT
3.16
spots
2.97
Spots
2.89
spots
2.80
SPOT
2.80
Spots
2.46
Activations Density 0.035%