With advancements in hardware and the rapid development of the internet, the clarity of surveillance videos has significantly improved, allowing people to quickly understand incidents in a specific location without being physically present. This is especially crucial in maritime navigation, where there is a lack of the solid ground feeling of safety. High-definition surveillance videos transmitted from the sea become a medium for land-based entities to ensure maritime safety. However, due to the absence of base stations at sea, these videos must be transmitted via satellite communication, which is limited by bandwidth and efficiency. Under normal weather conditions, the upload speed for a movie is generally below 200kb/s, and even lower during bad weather, meaning it could take a whole day to transmit a two-hour movie. To rapidly transmit surveillance videos without changing hardware, the data is compressed during transmission, such as by reducing the resolution. However, there is a high demand for the clarity of these images or videos, necessitating the use of image super-resolution techniques at the receiving end to enhance resolution and reconstruct the original high-definition images or videos as closely as possible.To minimize the loss of video information or even eliminate it, a method based on video super-resolution using the SWIN-ESR network is proposed to restore the original video, achieving the goal of transmission. This paper will compare SWIN-ESR with SRCNN, SRGAN, ESRGAN, and Real-ESRGAN, and evaluate the models through a series of performance metrics. The experimental results demonstrate that SWIN-ESR performs well in various numerical aspects.