Python how to grouping/merge rows in to single row

88arvin · Jan 11, 2023

Greetings!

Actually, I converted the PDF file containing the tables into a Pandas dataframe and then into Excel. Some cells in a PDF document contain multiline text.
I've previously converted PDFs into a Pandas dataframe and then into Excel, but in those PDFs, the cells with multiline text had a \n at the end of the line, so I managed to make the multiline text into a single line/cell, but in this PDF there is no \n.

So I want the text into one line/cell, but I am not able to do so. Can anybody please help me with the same?

I hope I am able to make you understand my question.

I am also attaching pictures of what I have in my Pandas dataframe and what I want for your reference.

This is what I getting after exporting dataframe into excel

And this is I want

Thanks in advance

Deleted member 2829 · Jan 14, 2023

So there are 3 stages of data here " PDF, Pandas data frame, and Excel sheet. If I understand correctly, the problem is already apparent in your Pandas data frame, and if you were to correct that, the Excel sheet would also be fine ? Please confirm, or else explain more.
So exactly how do you convert a PDF to a Pandas data frame ? Can we see the PDF and the code for that ?

simong1993 · Jan 14, 2023

Hey Mate, upload your code and something for us to work with and we can help you more

88arvin · Jan 18, 2023

cbreemer said:
So there are 3 stages of data here " PDF, Pandas data frame, and Excel sheet. If I understand correctly, the problem is already apparent in your Pandas data frame, and if you were to correct that, the Excel sheet would also be fine ? Please confirm, or else explain more.
So exactly how do you convert a PDF to a Pandas data frame ? Can we see the PDF and the code for that ?

I converted the PDF into Pandas using Tabula. I'm sorry, but because it's a bank statement(confidential), I can't give you guys access to the PDF.

88arvin · Jan 18, 2023

Can you please tell me how to add a new column before the PostDate column and enter a serial number where the PostDate column contains a value, and keep the space empty where the PostColumn contains nothing.

Code:

   NewColumn                PostDate
      1                    01-04-2012
      2                    03-04-2012
      3                    05-04-2012



      4                    10-04-2012

Welcome!

Python how to grouping/merge rows in to single row

88arvin

New Coder

Deleted member 2829

Guest

simong1993

Gold Coder

88arvin

New Coder

88arvin

New Coder

New Threads

Latest posts

Share this page

Buy us a coffee!

About Us

Site links

We value your privacy

Welcome!

Python how to grouping/merge rows in to single row

88arvin

New Coder

Deleted member 2829

Guest

simong1993

Gold Coder

88arvin

New Coder

88arvin

New Coder

Log in

New Threads

Latest posts

Share this page

Buy us a coffee!

About Us

Site links

Stay Connected

We value your privacy