Distance Weighted Discrimination for Python
Project description
Overview
This package implements Distance Weighted Discrimination (DWD). DWD For details see (Marron et al 2007, Wang and Zou 2018). Originally implemented in Python by Iain Carmichael. Currently maintained by Kitware, Inc.
The package currently implements:
-
Original DWD formulation solved with Second Order Cone Programming (SOCP) and solved using cvxpy.
-
Genralized DWD (gDWD) and kernel gDWD solved with the Majorization-Minimization algorithm presented in Wang and Zou, 2018.
Marron, James Stephen, Michael J. Todd, and Jeongyoun Ahn. "Distance-weighted discrimination." Journal of the American Statistical Association 102, no. 480 (2007): 1267-1271.
Wang, Boxiang, and Hui Zou. "Another look at distance‐weighted discrimination." Journal of the Royal Statistical Society: Series B (Statistical Methodology) 80, no. 1 (2018): 177-198.
Installation
The DWD package can be installed via pip or github. This package is currently only tested in python 3.6.
$ pip install dwd
The conic solver socp_dwd.DWD
depends on cvxpy
, which is not available on all platforms. See the cvxpy
installation instructions. If cvxpy
dependencies are met, then use pip install dwd[socp]
.
Flit is used for packaging, and all package metadata is stored in pyproject.toml
. To install this project locally or for development, use flit install
or build a pip-installable wheel with flit build
.
Example
from sklearn.datasets import make_blobs
from dwd.socp_dwd import DWD
# sample sythetic training data
X, y = make_blobs(
n_samples=200,
n_features=2,
centers=[[0, 0],
[2, 2]],
)
# fit DWD classifier
dwd = DWD(C='auto').fit(X, y)
# compute training accuracy
dwd.score(X, y) # 0.94
from sklearn.datasets import make_circles
from dwd.gen_kern_dwd import KernGDWD
# sample some non-linear, toy data
X, y = make_circles(n_samples=200, noise=0.2, factor=0.5, random_state=1)
# fit kernel DWD wit gaussian kernel
kdwd = KernGDWD(
lambd=.1, kernel='rbf',
kernel_kws={'gamma': 1},
).fit(X, y)
# compute training accuracy
kdwd.score(X, y) # 0.915
For more example code see these example notebooks (including the code to generate the above figures). If the notebooks aren't loading on github you can copy/paste the notebook url into https://nbviewer.jupyter.org/.
Help and Support
Additional documentation, examples and code revisions are coming soon.
Documentation
The source code is located on github: https://github.com/slicersalt/dwd
Testing
Testing is done using nose
.
Contributing
We welcome contributions to make this a stronger package: data examples, bug fixes, spelling errors, new features, etc.
Project details
Download files
Download the file for your platform. If you're not sure which to choose, learn more about installing packages.
Source Distribution
Built Distribution
File details
Details for the file dwd-1.0.5.tar.gz
.
File metadata
- Download URL: dwd-1.0.5.tar.gz
- Upload date:
- Size: 214.3 kB
- Tags: Source
- Uploaded using Trusted Publishing? No
- Uploaded via: python-requests/2.25.1
File hashes
Algorithm | Hash digest | |
---|---|---|
SHA256 | 3b93790801dfc5f7002a74a1894bfc2229de3ccbe4a027bf800a99fbfcbc67aa |
|
MD5 | 58f852943ca35ac04af711e04f082380 |
|
BLAKE2b-256 | 4985274e5785a9f02f66195ce2fc025e925ec09685e10aa8ea90d5b51852336e |
File details
Details for the file dwd-1.0.5-py3-none-any.whl
.
File metadata
- Download URL: dwd-1.0.5-py3-none-any.whl
- Upload date:
- Size: 22.7 kB
- Tags: Python 3
- Uploaded using Trusted Publishing? No
- Uploaded via: python-requests/2.25.1
File hashes
Algorithm | Hash digest | |
---|---|---|
SHA256 | fcd48b77c390ff882a1750ef263971943c44b05071332d9d5d59b87cd1816ad8 |
|
MD5 | d6524373295e27cca5b98b9054ee6c24 |
|
BLAKE2b-256 | 81e92484e01d71982cc8ea7e7c69224e4b6ca872b562b2246642fd1537fe0756 |