This is a package for converting pcap files primarily to parquet format. CSV and JSON formats are also supported.
To install the package, run the following command:
pip install pcaptoparquetOr with a virtual environment:
python -m venv <your_venv>
source <your_venv>/bin/activate # In windows: .\<your_venv>\Scripts\activate
python -m pip install --upgrade pip
python -m pip install pcaptoparquetTo see all available options:
pcaptoparquet -hNote 1: Portable executable can be created with pyinstaller but this executable has not been fully tested. Check the Makefile for more information.
Note 2: CLI interface was not fully tested in Windows environment. Unit testing is only instrumented for Linux.
The pcaptoparquet package provides the E2EPcap class for converting pcap files to different formats. Here's how you can use it:
- Import the
E2EPcapclass from thepcaptoparquetpackage:
from pcaptoparquet import E2EPcap- Create an instance of
E2EPcapwith the path to your pcap file:
pcap = E2EPcap('path_to_your_pcap_file')Replace 'path_to_your_pcap_file' with the actual path to your pcap file.
- Use the
exportmethod of theE2EPcapinstance to convert the pcap data to a different format:
pcap.export(format='parquet', output='output_directory')The format parameter specifies the output format. In this example, we're converting the pcap data to parquet format.
The output parameter specifies the directory where the output file will be saved. Replace 'output_directory' with the actual path to your output directory.
This is a basic example of how to use the pcaptoparquet package. Depending on your needs, you might need to use additional methods or parameters.
Refer to pcaptoparquet_cli.py for a more complex example of use. In particular, refer to extensibility options such as application protocol implementations and post-processing callbacks associated to E2EConfig class. Full examples included in tests folder (config and callbacks subfolders).
Contributions are welcome. Please follow these steps to contribute:
- Fork the repository.
- Create a new branch (
git checkout -b feature-branch). - Make your changes.
- Run the tests and code quality checks to ensure everything works correctly.
- Commit your changes (
git commit -am 'Add new feature'). - Push to the branch (
git push origin feature-branch). - Create a new Pull Request.
Please make sure to update tests as appropriate.
After cloning the repository, you can set up your development environment:
python -m venv <your_venv>
source <your_venv>/bin/activate # In windows: .\<your_venv>\Scripts\activate
python -m pip install --upgrade pip
python -m pip install -e .
python -m pip install black isort pyright flake8 Flake8-pyproject mypy tox coverage build twineor
make venvThe pcaptoparquet package includes a suite of tests to ensure its functionality. These tests are located in the tests directory.
To run the tests, you'll need tox, which is a tool for automating testing in multiple Python environments.
Here's how you can run the tests:
toxor
make testThis command will run all the tests in the tests directory and display the results in the terminal.
If you make changes to the pcaptoparquet code, please make sure to run the tests and ensure they all pass before submitting a pull request.
Coverage is also available but only informational for now:
make coverageThe pcaptoparquet project uses several tools to ensure code quality:
black: for code formattingisort: for sorting importspyright: for type checkingflake8: for lintingmypy: for static type checking
You can run these checks using the following commands:
black --check pcaptoparquet_cli.py pcaptoparquet tests
isort --check-only pcaptoparquet_cli.py pcaptoparquet tests
pyright pcaptoparquet_cli.py pcaptoparquet tests
flake8 pcaptoparquet_cli.py pcaptoparquet tests
mypy pcaptoparquet_cli.py pcaptoparquet testsor
make checkTo automatically fix formatting issues:
make fixThis project is licensed under the BSD-3-Clause License. See the LICENSE file for more details. Copyright 2025 Nokia.
Additional documentation can be found at pcaptoparquet.