Perception of images and videos is affected by constraints due to physical systems such as acquisition and display, degradations caused by storage (compression) and streaming (transmission). In addition, perception is also dependent on the characteristics of the human visual system. All these factors, that contribute to the perception of a visual stimuli, are combined under a single pipeline we denote as Pixels to Perception (P2P). The P2P pipeline starts with capturing an image, includes its storage in digital platforms and its transfer to other devices. Eventually, the multimedia content is perceived by end users when displayed. At P2P, we represent and analyze perception, by assessing subjective quality of both videos and images, and modeling objective quality estimators.