INDEX
Explanations
references to worshippers
words related to worshipers or worship
New Auto-Interp
Negative Logits
stage
-0.76
Sense
-0.68
Zah
-0.64
Stage
-0.63
inst
-0.63
stack
-0.61
HL
-0.60
Yar
-0.60
ethe
-0.60
stage
-0.59
POSITIVE LOGITS
ippers
4.38
ipper
2.87
ipping
2.00
ipped
1.53
ipp
1.52
appers
1.46
ips
1.41
ippy
1.40
IPP
1.20
IPS
1.19
Activations Density 0.005%