Welcome to msg parser’s documentation!

msg_parser

https://img.shields.io/pypi/v/msg_parser.svg https://img.shields.io/travis/vikramarsid/msg_parser.svg Documentation Status Updates

Python module for parsing outlook msg files.

Features

  • Parse MSG file.
  • Convert MSG file to EML file.
  • Output MSG file as JSON string.
  • Handles nested MSG/EML attachments.
  • Works 100% on Linux machines, do not require any windows libraries.

Installation

  • Basic installation

    pip install msg_parser
    
  • With RTF decompression

    pip install msg_parser[rtf]
    

Usage

  • Run CLI command

    $ msg_parser --help
      usage: msg_parser [-h] -i FILE [-j] [-e EML_FILE]
    
     Microsoft Message Parser
    
     optional arguments:
         -h, --help            show this help message and exit
         -i FILE, --input FILE
                               msg file path
         -j, --json            output parsed msg as json to console
         -e EML_FILE, --eml EML_FILE
                               provide email file path to save as eml file.
    
  • Import in python modules

    from msg_parser import MsOxMessage
    
    msg_obj = MsOxMessage(msg_file_path)
    
    json_string = msg_obj.get_message_as_json()
    
    msg_properties_dict = msg_obj.get_properties()
    
    saved_path = msg_obj.save_email_file(output_eml_file_path)
    

Installation

Stable release

To install msg_parser, run this command in your terminal:

$ pip install msg_parser

This is the preferred method to install msg_parser, as it will always install the most recent stable release.

If you don’t have pip installed, this Python installation guide can guide you through the process.

From sources

The sources for msg_parser can be downloaded from the Github repo.

You can either clone the public repository:

$ git clone git://github.com/vikramarsid/msg_parser

Or download the tarball:

$ curl  -OL https://github.com/vikramarsid/msg_parser/tarball/master

Once you have a copy of the source, you can install it with:

$ python setup.py install

Usage

To use msg_parser in a project:

from msg_parser import MsOxMessage

msg_obj = MsOxMessage(msg_file_path)

json_string = msg_obj.get_message_as_json()

msg_properties_dict = msg_obj.get_properties()

saved_path = msg_obj.save_email_file(output_eml_file_path)

msg_parser

msg_parser package

Subpackages

msg_parser.properties package
Submodules
msg_parser.properties.ms_props_date_type_map module
msg_parser.properties.ms_props_generator module
msg_parser.properties.ms_props_generator.generate_data_id_type_mapping(master_map)[source]
msg_parser.properties.ms_props_generator.generate_id_name_mapping(master_map)[source]
msg_parser.properties.ms_props_generator.generate_master_properties()[source]
msg_parser.properties.ms_props_id_map module
Module contents

Submodules

msg_parser.cli module

Console script for msg_parser.

class msg_parser.cli.FullPaths(option_strings, dest, nargs=None, const=None, default=None, type=None, choices=None, required=False, help=None, metavar=None)[source]

Bases: argparse.Action

Expand user- and relative-paths

msg_parser.cli.create_parser(args)[source]
msg_parser.cli.is_dir(dir_name)[source]

Checks if a path is an actual directory

msg_parser.cli.main()[source]

msg_parser.data_models module

class msg_parser.data_models.DataModel[source]

Bases: object

static PtypBinary(data_value)[source]
static PtypBoolean(data_value)[source]
static PtypCurrency(data_value)[source]
static PtypErrorCode(data_value)[source]
static PtypFloating32(data_value)[source]
static PtypFloating64(data_value)[source]
static PtypFloatingTime(data_value)[source]
static PtypGuid(data_value)[source]
static PtypInteger16(data_value)[source]
static PtypInteger32(data_value)[source]
static PtypInteger64(data_value)[source]
static PtypMultipleBinary(data_value)[source]
static PtypMultipleCurrency(data_value)[source]
static PtypMultipleFloating32(data_value)[source]
static PtypMultipleFloating64(data_value)[source]
static PtypMultipleFloatingTime(data_value)[source]
static PtypMultipleGuid(data_value)[source]
static PtypMultipleInteger16(data_value)[source]
static PtypMultipleInteger32(data_value)[source]
static PtypMultipleInteger64(data_value)[source]
static PtypMultipleString(data_value)[source]
static PtypMultipleString8(data_value)[source]
static PtypMultipleTime(data_value)[source]
static PtypNull(_)[source]
static PtypObject(data_value)[source]
static PtypRestriction(data_value)[source]
static PtypRuleAction(data_value)[source]
static PtypServerId(data_value)[source]
static PtypString(data_value)[source]
static PtypString8(data_value)[source]
static PtypTime(data_value)[source]
static PtypUnspecified(data_value)[source]
get_value(data_value, data_type_name=None, data_type=None)[source]
static lookup_data_type_name(data_type)[source]
msg_parser.data_models.get_floating_time(data_value)[source]
msg_parser.data_models.get_multi_value_offsets(data_value)[source]
msg_parser.data_models.get_time(data_value)[source]

msg_parser.email_builder module

class msg_parser.email_builder.EmailFormatter(msg_object)[source]

Bases: object

build_email()[source]
save_file(file_path, file_name=None)[source]
msg_parser.email_builder.flatten_list(string_list)[source]
msg_parser.email_builder.normalize(input_str)[source]

msg_parser.msg_parser module

class msg_parser.msg_parser.Attachment(attachment_properties)[source]

Bases: object

class to store attachment attributes

class msg_parser.msg_parser.Message(directory_entries)[source]

Bases: object

Class to store Message properties

as_dict()[source]

returns message attributes as a python dictionary. :return: dict

class msg_parser.msg_parser.MsOxMessage(msg_file_path)[source]

Bases: object

Base class for Microsoft Message Object

get_email_mime_content()[source]
get_message_as_json()[source]
get_properties()[source]
get_properties_as_dict()[source]
is_valid_msg_file()[source]
save_email_file(file_path, file_name=None)[source]
class msg_parser.msg_parser.Recipient(recipients_properties)[source]

Bases: object

class to store recipient attributes

msg_parser.msg_parser.format_size(num, suffix='B')[source]
msg_parser.msg_parser.parse_email_headers(header, raw=False)[source]

Module contents

Top-level package for msg_parser.

Contributing

Contributions are welcome, and they are greatly appreciated! Every little bit helps, and credit will always be given.

You can contribute in many ways:

Types of Contributions

Report Bugs

Report bugs at https://github.com/vikramarsid/msg_parser/issues.

If you are reporting a bug, please include:

  • Your operating system name and version.
  • Any details about your local setup that might be helpful in troubleshooting.
  • Detailed steps to reproduce the bug.

Fix Bugs

Look through the GitHub issues for bugs. Anything tagged with “bug” and “help wanted” is open to whoever wants to implement it.

Implement Features

Look through the GitHub issues for features. Anything tagged with “enhancement” and “help wanted” is open to whoever wants to implement it.

Write Documentation

msg_parser could always use more documentation, whether as part of the official msg_parser docs, in docstrings, or even on the web in blog posts, articles, and such.

Submit Feedback

The best way to send feedback is to file an issue at https://github.com/vikramarsid/msg_parser/issues.

If you are proposing a feature:

  • Explain in detail how it would work.
  • Keep the scope as narrow as possible, to make it easier to implement.
  • Remember that this is a volunteer-driven project, and that contributions are welcome :)

Get Started!

Ready to contribute? Here’s how to set up msg_parser for local development.

  1. Fork the msg_parser repo on GitHub.

  2. Clone your fork locally:

    $ git clone git@github.com:your_name_here/msg_parser.git
    
  3. Install your local copy into a virtualenv. Assuming you have virtualenvwrapper installed, this is how you set up your fork for local development:

    $ mkvirtualenv msg_parser
    $ cd msg_parser/
    $ python setup.py develop
    
  4. Create a branch for local development:

    $ git checkout -b name-of-your-bugfix-or-feature
    

    Now you can make your changes locally.

  5. When you’re done making changes, check that your changes pass flake8 and the tests, including testing other Python versions with tox:

    $ flake8 msg_parser tests
    $ python setup.py test or py.test
    $ tox
    

    To get flake8 and tox, just pip install them into your virtualenv.

  6. Commit your changes and push your branch to GitHub:

    $ git add .
    $ git commit -m "Your detailed description of your changes."
    $ git push origin name-of-your-bugfix-or-feature
    
  7. Submit a pull request through the GitHub website.

Pull Request Guidelines

Before you submit a pull request, check that it meets these guidelines:

  1. The pull request should include tests.
  2. If the pull request adds functionality, the docs should be updated. Put your new functionality into a function with a docstring, and add the feature to the list in README.rst.
  3. The pull request should work for Python 2.7, 3.4, 3.5 and 3.6, and for PyPy. Check https://travis-ci.org/vikramarsid/msg_parser/pull_requests and make sure that the tests pass for all supported Python versions.

Tips

To run a subset of tests:

$ py.test tests.test_msg_parser

Deploying

A reminder for the maintainers on how to deploy. Make sure all your changes are committed (including an entry in HISTORY.rst). Then run:

$ bumpversion patch # possible: major / minor / patch
$ git push
$ git push --tags

Travis will then deploy to PyPI if tests pass.

Credits

Development Lead

Contributors

None yet. Why not be the first?

History

0.1.0 (2018-10-26)

  • Initial commit.

1.0.0 (2018-10-29)

  • Initial pypi release.
  • Added headers_dict to message object.

1.2.0 (2019-12-12)

  • Python 3.x support
  • Updates to Makefile
  • Formatting and linting

Indices and tables