This is a Python package that processes company names, providing cleaned versions of the names by stripping away terms indicating organization type (such as "Ltd." or "Corp").
Using a database of organization type terms, It also provides an utility to deduce the type of organization, in terms of US/UK business entity types (ie. "limited liability company" or "non-profit").
Finally, the system uses the term information to suggest countries the organization could be established in. For example, the term "Oy" in company name suggests it is established in Finland, whereas "Ltd" in company name could mean UK, US or a number of other countries.
Just use 'pip install cleanco' if you have pip installed (as most systems do). Or download the zip distribution from this site, unzip it and then:
cdinto it, and enter
sudo python setup.py installalong with your system password.
Let's look at some sample code. To get the base name of a business without legal suffix:
from cleanco import prepare_terms, basename business_name = "Some Big Pharma, LLC" terms = prepare_terms() basename(business_name, terms, prefix=False, middle=False, suffix=True) 'Some Big Pharma'
Note that sometimes a name may have e.g. two different suffixes after one another. The cleanco
term data covers many of these but you may want to run
basename() twice, just in case.
To get the business type or country:
from cleanco import typesources, matches classification_sources = typesources() matches("Some Big Pharma, LLC", classification_sources) ['Limited Liability Company']
To get the possible countries of jurisdiction:
from cleanco import countrysources, matches classification_sources = countrysources() matches("Some Big Pharma, LLC", classification_sources) ´ ['United States of America', 'Philippines']
The legacy (versions < 2.0) way can still be used, too, but will eventually be discontinued:
Import the utility class:
from cleanco import cleanco
Prepare a string of a company name that you want to process:
>>> business_name = "Some Big Pharma, LLC"
Throw it into the instance:
>>> x = cleanco(business_name)
You can now get the company types:
>>> x.type() ['Limited Liability Company']
...the possible countries...
'United States of America', 'Philippines']x.country() [
...and a clean version of the company name.
'Some Big Pharma'x.clean_name()
See the issue tracker. If you find a bug or have enhancement suggestion or question, please file an issue and provide a PR if you can. For example, some of the company suffixes may be incorrect or there may be suffixes missing.
To run tests, simply install the package and run
python setup.py test. To run tests on multiple Python versions, install
tox and run it (see the provided tox.ini).