consider making html5lib.tokenizer public

Hello,

In version https://github.com/html5lib/html5lib-python/releases/tag/0.999999999 , `html5lib.tokenizer` was made private

The `wpull` project (https://github.com/ArchiveTeam/wpull ) uses this library, and if we were to ever migrate to using the 1.X versions, it would negatively impact the application, because instead of just tokenizing a webpage (see https://github.com/ArchiveTeam/wpull/blob/a4ff4a93f613ce18ad3c515aa3d4f5848a88b98c/wpull/document/htmlparse/html5lib_.py ), we would have to use the full tree parsing which is slower and uses more ram

is there any reason this was made private when the 1.x branch was released?

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

consider making html5lib.tokenizer public #532

Metadata

Assignees

Labels

Type

Projects

Milestone

Relationships

Development

consider making html5lib.tokenizer public #532

Description

Metadata

Metadata

Assignees

Labels

Type

Projects

Milestone

Relationships

Development

Issue actions