How to Add Watermark to a PDF File Using Python

Post Views: 1,967

Hey there everyone, Today we are going to learn how to add a watermark to a pdf file using Python.
We will be using the PyPDF2 library of Python which is capable of merging two pdf files.

Add Watermark to a PDF file in Python

We have two pdf files one of which contains only text(can also have images) and the other one contains the watermark to be added.
The basic idea behind this would be merging the two pdf files.

Our watermark file “watermark.pdf” is:
watermark.pdf

Screenshot of the file is below:

add watermark to pdf file in Python

We will be adding the above-mentioned watermark to the pdf file “doc.pdf”:
doc.pdf

After merging the above two pdf files we will get our output file containing the contents of both “watermark.pdf” and “doc.pdf”.

LET’S DISCUSS THE STEPS INVOLVED :

Importing the PyPDF2 module.
```
import PyPDF2
```

Storing the contents of the pdf file and the watermark file.

pdf_file = "doc.pdf"
watermark = "watermark.pdf"
merged_file = "merged.pdf"

Open and Read the pdf file and the watermark file.

input_file = open(pdf_file,'rb')
input_pdf = PyPDF2.PdfFileReader(pdf_file)

watermark_file = open(watermark,'rb')
watermark_pdf = PyPDF2.PdfFileReader(watermark_file)

Accessing the pages of the pdf file and the watermark file to be merged, Index 0 is used to access the first page.
```
pdf_page = input_pdf.getPage(0)

watermark_page = watermark_pdf.getPage(0)
```
Merging the pages.
```
pdf_page.mergePage(watermark_page)
```

Saving our file in the output.

output = PyPDF2.PdfFileWriter()
output.addPage(pdf_page)

The final pdf file after adding the watermark is stored in merged_file.
```
merged_file = open(merged_file,'wb')
output.write(merged_file)
```

closing the files.

merged_file.close()
watermark_file.close()
input_file.close()

Python program to add watermark to pdf

import PyPDF2

pdf_file = "doc.pdf"

watermark = "watermark.pdf"

merged_file = "merged.pdf"

input_file = open(pdf_file,'rb')
input_pdf = PyPDF2.PdfFileReader(input_file)

watermark_file = open(watermark,'rb')
watermark_pdf = PyPDF2.PdfFileReader(watermark_file)

pdf_page = input_pdf.getPage(0)

watermark_page = watermark_pdf.getPage(0)

pdf_page.mergePage(watermark_page)

output = PyPDF2.PdfFileWriter()

output.addPage(pdf_page)

merged_file = open(merged_file,'wb')
output.write(merged_file)

merged_file.close()
watermark_file.close()
input_file.close()

After the successful execution of this code, we will have our output pdf file named “merged.pdf”.
merged.pdf

Screenshot:

watermarking to pdf

17 responses to “How to Add Watermark to a PDF File Using Python”

Shyok Mutsuddi says:

January 1, 2020 at 2:53 pm

Nice explaination!

Reply
- Sushant Shaw says:
  
  January 2, 2020 at 9:20 pm
  
  Thank you:)
  
  Reply
Sulaiman says:

January 7, 2020 at 2:19 am

any idea how to make watermark background transparent?

Reply
- Sushant Shaw says:
  
  January 7, 2020 at 9:28 pm
  
  The transparency of the watermark can be adjusted while creating it.
  
  Reply
  - Jake says:
    
    March 23, 2021 at 1:05 am
    
    how can i do that? Because when I overlay like this code suggest it covers my text
    
    Reply
HeIsRealMagic says:

March 16, 2020 at 9:24 pm

So I am doing this with a 6 page input pdf and want all the pages to have the watermark on them. The loop seems to work as I get the input pdf out but none of the pages have the watermark. The watermark pdf is a blank page with an image covering the whole page with no margins. Any ideas why it would not show up?

Reply
Asif Fazal says:

March 19, 2020 at 5:10 am

The above code can only watermark one page… Here is the improved one with unlimited page compatability..

import PyPDF2

template = PyPDF2.PdfFileReader(open(“inputPDF”, ‘rb’))
watermark = PyPDF2.PdfFileReader(open(“WaterMarkPDF”, ‘rb’))
output = PyPDF2.PdfFileWriter()

for i in range(template.getNumPages()):
page = template.getPage(i)
page.mergePage(watermark.getPage(0))
output.addPage(page)

file = open(“waterMarked_PDF.pdf”, ‘wb’)
output.write(file)

Reply
Sale says:

April 7, 2020 at 12:17 am

input_file = open(pdf_file,’rb’)
input_pdf = PyPDF2.PdfFileReader(pdf_file)
——————————————————————

its input file at reader !

Reply
- Saruque Ahamed Mollick says:
  
  April 7, 2020 at 9:29 pm
  
  Thank you very much. The code has been updated now.
  
  Reply
Harrison says:

April 27, 2020 at 12:00 pm

Is there any way I could add a position for the watermark to be placed?

If for example I was to use a 1200x500px PDF as the watermark, could I tell it to place it over the bottom right corner of the PDF?

Reply
- John doe says:
  
  May 21, 2020 at 7:15 pm
  
  set the position in the watermark.pdf file
  
  Reply
lin says:

August 21, 2020 at 5:54 am

Explained Beautifully, thank you so much

Reply
Not_the_radbrad says:

March 22, 2022 at 6:14 pm

Hi, can you make the same but with a user interface allowing users to select logbook from the Gui

Reply
dan says:

May 6, 2022 at 7:43 am

this doesn’t add any transparency to the watermark, so it will cover the existing text

Reply
niclas says:

August 24, 2022 at 10:20 pm

Helped me a lot, thank you!
Is it possible to also add custom text to the pdfs?
Like Adding additionally a unique number to every page?

Reply
Adarsh M Jatti says:

September 11, 2022 at 6:06 pm

I copied this code to watermark a pdf file, I followed all the steps that u mentioned, but when I do it and create the output file, the text in the input file is not visible and only watermark is visible, but when I extract the content of the output file all the text and watermark content is printed in terminal.Can u please guide me with this.

Reply
Shaibu says:

June 8, 2023 at 6:14 pm

Hi, what is if want the watermark to duplicate on the whole page.

Reply

How to Add Watermark to a PDF File Using Python

Add Watermark to a PDF file in Python

Python program to add watermark to pdf

17 responses to “How to Add Watermark to a PDF File Using Python”

Leave a Reply Cancel reply

Related Posts