INDEX
Explanations
instances where something is being represented or portrayed in a particular way
references to the concept of reflection or things that embody reflection
New Auto-Interp
Negative Logits
Klu
-0.65
Hun
-0.65
killer
-0.64
oen
-0.63
Word
-0.62
Mellon
-0.61
Mechan
-0.61
opolis
-0.61
Pul
-0.60
illa
-0.59
POSITIVE LOGITS
reflect
3.68
reflect
2.27
reflects
2.18
reflecting
2.17
Reflect
2.15
reflected
2.08
reflective
2.06
reflection
1.82
reflections
1.45
mirror
1.43
Activations Density 0.008%