Missing Ticket Content from Mail #1063

brucegibbins · 2023-01-13T00:10:39Z

When a ticket is raised via email that has multiple parts with the body being html the content is not used for ticket Description or Comment.

I suspect that this may be somehow related to the email being an O365 mailbox.

I have troubleshot the code and can see where the multipart sections are parsed in the extract_part_data() function. In my case the content type is 'text/html'.

This portion of the code populates the part_full_body variable after parsing with beautifulsoup but it does not populate the part_body variable as it does if the message was pure text.

Because of this the calling function object_from_message() does not receive any value for the body variable. The end result is that the following conditions looking for body content all fail.

I am not sure if this is an RFC formatting issue. But I am working on trying to resolve unless someone has any ideas

uhurusurfa · 2023-01-30T15:59:26Z

The code will need to be upodated to handle the described situation.
If you are able to fix yourself and raise a PR please feel free otherise it will gt fixed in due cxourse as develoipers are able to devote the time to it.

brucegibbins · 2023-01-30T22:46:11Z

Thank you for your reply. I am considering the PR option. However I have made a number of changes including the ability to use Oauth for mail authenticaton. Therefore, there a number of "changes' spread across a number of modules. What would be the best way to manage this in the context of a PR? Thanks in advance.

…

On Tue, 31 Jan 2023, 1:59 am Christopher Broderick, < ***@***.***> wrote: The code will need to be upodated to handle the described situation. If you are able to fix yourself and raise a PR please feel free otherise it will gt fixed in due cxourse as develoipers are able to devote the time to it. — Reply to this email directly, view it on GitHub <#1063 (comment)>, or unsubscribe <https://github.com/notifications/unsubscribe-auth/AEAXOGBXPXMINKJILKUZHCLWU7QOTANCNFSM6AAAAAATZ2MJ2M> . You are receiving this because you authored the thread.Message ID: ***@***.***>

uhurusurfa · 2023-03-24T01:00:56Z

Do you have an example email where this is happening?
It sounds like it is an amail that does not have a text/plain part to it and would make it quicker to create a unit test for this if you can procide the sample that is breaking it..

brucegibbins · 2023-03-24T09:04:11Z

Hi. I actually think now that the small change I made here has led to the other issue I posted regarding the extra line feed after the body was extracted. Let me not waste too much of your time and get quicktests working with my OAUTH mod and then i can relook at this original post. Kind Regards

brucegibbins · 2023-03-30T05:58:30Z

I am working on a PR for this.

If the mail body is html which in our case, it is because most staff use Outlook.

I have discovered that the after determining that the mail part subtype is 'html' the variable full_part_body is initialised but the variable part_body never carries a value when this condition is true. Then in the calling function object_from_message() the variable body never carries a value and then this is used to populate the description field downstream. Therefore, the description remains empty. If there are multiple mail parts and one of these happens to be text, then the part_body variable does get a value, but it may not be the one expected.

Something needs to happen near line 778 in in email.py unless I am missing something.

uhurusurfa · 2023-03-30T15:32:13Z

Looking at the code history, it seems there was an effort to deal with this issue over time and appears to have resulted in a fairly convoluted bit of code.

It is probably worth defining exactly what the extract process should be then refactor the code to do that.

The first commit more than 4 years ago iterated the body parts if it was a multipart looking for a text/plain part and used that as the message body.
The fallback was to search for a body element by converting the entire mime message to a string which does not really make sense from my point of view as the outcome is likely to be hit and miss.

Modern emails generally are sent with a multipart that encapsulates both a text/plain and a text/html part by default unless the email client is configured to do differently. Some disable the text/plain part so the body has to come from the text/html part.

The problem gets complicated when attachments and inline content are added resulting in a multipart within a multipart. And then attached emails can themselves have a similar structure resulting in a multidepth message.

So small wonder you are seeing unreliable processing of emails and really the algorithm needs to be rewritten as a recursive decent parser if the email is to be processed reliably catering for all the variations that may be possible depnding on the email client that assempled the email.

brucegibbins · 2023-03-31T03:31:33Z

Yep. I put some changes in on ourside and was getting better results. But tests on main side failed. For me as I need a solution in relatively quick time but would like to see contribution back may I suggest the following as a small step forward that can be ignored by others who may get source from repo or pypi. * introduce new queue email method type. Lets call it for what it is O365 * it uses generic oauth2 library but need to store client id and access token guff so maybe that goes in external file or settings. It could go into table but then need migrations and security * then i put new pickup code in a new email module or it can go into existing email.py. either way it becomes avaialble for cron job or in my case schedule task * seperate test written The thing is that I can get 100 accuracy for me by using an existing o365 module that handles all of the authentication and all of the o365 mail fluff with attachments etc. Whilst this may be a pip install dependency so is pinax. So really a concern in my opinon and maybe if configuration driven can be done in a non break fashion I am sure you have a much broader view of none breaking and so i have probably offended you. But in some cases and in my opinion sometimes better to break and fix. I think if the project did have reliable o365 integration there may be more uptake and more contriution.

…

On Fri, 31 Mar 2023, 1:32 am Christopher Broderick, < ***@***.***> wrote: Looking at the code history, it seems there was an effort to deal with this issue over time and appears to have resulted in a fairly convoluted bit of code. It is probably worth defining exactly what the extract process should be then refactor the code to do that. The first commit more than 4 years ago iterated the body parts if it was a multipart looking for a text/plain part and used that as the message body. The fallback was to search for a body element by converting the entire mime message to a string which does not really make sense from my point of view as the outcome is likely to be hit and miss. Modern emails generally are sent with a multipart that encapsulates both a text/plain and a text/html part by default unless the email client is configured to do differently. Some disable the text/plain part so the body has to come from the text/html part. The problem gets complicated when attachments and inline content are added resulting in a multipart within a multipart. And then attached emails can themselves have a similar structure resulting in a multidepth message. So small wonder you are seeing unreliable processing of emails and really the algorithm needs to be rewritten as a recursive decent parser if the email is to be processed reliably catering for all the variations that may be possible depnding on the email client that assempled the email. — Reply to this email directly, view it on GitHub <#1063 (comment)>, or unsubscribe <https://github.com/notifications/unsubscribe-auth/AEAXOGCKXOLMCPAHAS63MVLW6WRQRANCNFSM6AAAAAATZ2MJ2M> . You are receiving this because you authored the thread.Message ID: ***@***.***>

uhurusurfa · 2023-07-23T06:35:49Z

This PR #1104 should fix this issue - please confirm and close if so.

brucegibbins added the bug Something isn't working or is broken label Jan 13, 2023

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Missing Ticket Content from Mail #1063

Missing Ticket Content from Mail #1063

brucegibbins commented Jan 13, 2023

uhurusurfa commented Jan 30, 2023

brucegibbins commented Jan 30, 2023 via email

uhurusurfa commented Mar 24, 2023

brucegibbins commented Mar 24, 2023 •

edited

brucegibbins commented Mar 30, 2023

uhurusurfa commented Mar 30, 2023

brucegibbins commented Mar 31, 2023 via email

uhurusurfa commented Jul 23, 2023 •

edited

Missing Ticket Content from Mail #1063

Missing Ticket Content from Mail #1063

Comments

brucegibbins commented Jan 13, 2023

uhurusurfa commented Jan 30, 2023

brucegibbins commented Jan 30, 2023 via email

uhurusurfa commented Mar 24, 2023

brucegibbins commented Mar 24, 2023 • edited

brucegibbins commented Mar 30, 2023

uhurusurfa commented Mar 30, 2023

brucegibbins commented Mar 31, 2023 via email

uhurusurfa commented Jul 23, 2023 • edited

brucegibbins commented Mar 24, 2023 •

edited

uhurusurfa commented Jul 23, 2023 •

edited