Add support for MySQL #1

muir · 2021-07-16T02:38:04Z

This also backfills SkipIf

MySQL doesn't have DDL transactions so it needs a lot of help making sure that you don't shoot yourself in the foot.

Should sqltoken be its own repo?

Should dgorder be its own repo?

lsmysql/README.md

muir · 2021-07-16T20:50:48Z

Thanks, @aaronlehmann I've fixed those typos.

aaronlehmann

A little concerned about the complexity of the hand-rolled tokenizer. Have you looked at any third-party lexer libraries like https://github.com/alecthomas/participle?

aaronlehmann · 2021-07-17T15:20:28Z

sqltoken/tokenize.go

+			'n', 'o', 'p', 'q', 'r', 's', 't', 'u', 'v', 'w' /*x*/, 'y', 'z',
+			'A' /*B*/, 'C', 'D', 'E', 'F', 'G', 'H', 'I', 'J', 'K', 'L', 'M',
+			'N', 'O', 'P', 'Q', 'R', 'S', 'T' /*U*/, 'V', 'W' /*X*/, 'Y', 'Z',
+			'_':


This seems confusing and hard to maintain. At least there should be a comment explaining these values. It may make sense to handle these as ranges outside the switch.

This is a performance hack: anything missed will be caught by the unicode code path below. See added comment.

sqltoken/tokenize.go

aaronlehmann · 2021-07-17T15:22:09Z

sqltoken/tokenize.go

+			'A', 'B', 'C', 'D', 'E', 'F', 'G', 'H', 'I', 'J', 'K', 'L', 'M',
+			'N', 'O', 'P', 'Q', 'R', 'S', 'T', 'U', 'V', 'W', 'X', 'Y', 'Z',
+			'_',
+			'0', '1', '2', '3', '4', '5', '6', '7', '8', '9':


I think this would be cleaner with range expressions inside a general switch {

I've added a comment: this is a performance hack. Would a range expression perform better?

I would think the performance difference with a range expression would be imperceptible in the context of running database migrations.

I'm planning to use this code to fix sqlx --- it would be run on nearly every (or maybe every?) query done by sqlx. Performance matters in that context.

muir · 2021-07-19T04:47:44Z

A little concerned about the complexity of the hand-rolled tokenizer. Have you looked at any third-party lexer libraries like https://github.com/alecthomas/participle?

I would be concerned too except that I used the coverage tool and hit 100% coverage (the coverage tool is buggy so it doesn't report 100% but if you look at the details, it is 100%). Not all inputs tried, of course.

Part of the motivation for this is that I noticed that sqlx is incorrectly parsing SQL when they're doing substitutions -- what they're doing is high performance but wrong. My intent is to open a PR against sqlx to use my tokenizer. Due to the way I wrote my tokenizer, I expect it's performance to be very good.

If I used a regular lexer, I would need to treat MySQL and PostgreSQL as separate grammars.

Tradeoffs galor!

* mysql: add github action * mysql: github action: fix * mysql: github action: fix2 * mysql: github action: fix3

aaronlehmann reviewed Jul 16, 2021

View reviewed changes

lsmysql/README.md Outdated Show resolved Hide resolved

aaronlehmann reviewed Jul 16, 2021

View reviewed changes

lsmysql/README.md Outdated Show resolved Hide resolved

aaronlehmann reviewed Jul 17, 2021

View reviewed changes

muir force-pushed the mysql branch from 6d4be4b to ba91209 Compare March 30, 2022 04:49

muir and others added 20 commits March 29, 2022 21:59

mysql WIP

757a0a5

mysql WIP

eb37fc8

WIP tokenize fully tested except for unicode

3266331

WIP tokenize fully tested

2a32c6a

token strip

7b18eaa

mysql prefixed literals

074b2ca

Added Skip()

a5e60e7

mysql WIP

ed4fb69

first pass of mysql tests

a03f0dc

mysql: check test

5d89e85

mysql skip helpers

f7aa678

remove remaining idempotent reference

9604231

code review typos

c6560db

add some comments in reaction to Aaron's comments

8fe0c02

add some comments in reaction to Aaron's comments

9947b1c

mysql: add missing file: skip.go with utility functions

4b52b66

mysql: add github actions CI for mysql (#4)

d2f72d9

* mysql: add github action * mysql: github action: fix * mysql: github action: fix2 * mysql: github action: fix3

sqltoken is now its own repo

e4ee23d

update deps

496c334

update test name

f05f454

muir force-pushed the mysql branch from b39b5fd to f05f454 Compare March 30, 2022 05:02

muir merged commit 83aa21d into main Mar 30, 2022

muir deleted the mysql branch March 30, 2022 05:03

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Add support for MySQL #1

Add support for MySQL #1

muir commented Jul 16, 2021

muir commented Jul 16, 2021

aaronlehmann left a comment

aaronlehmann Jul 17, 2021

muir Jul 19, 2021

aaronlehmann Jul 17, 2021

muir Jul 19, 2021

aaronlehmann Jul 19, 2021

muir Jul 20, 2021

muir commented Jul 19, 2021

Add support for MySQL #1

Add support for MySQL #1

Conversation

muir commented Jul 16, 2021

muir commented Jul 16, 2021

aaronlehmann left a comment

Choose a reason for hiding this comment

aaronlehmann Jul 17, 2021

Choose a reason for hiding this comment

muir Jul 19, 2021

Choose a reason for hiding this comment

aaronlehmann Jul 17, 2021

Choose a reason for hiding this comment

muir Jul 19, 2021

Choose a reason for hiding this comment

aaronlehmann Jul 19, 2021

Choose a reason for hiding this comment

muir Jul 20, 2021

Choose a reason for hiding this comment

muir commented Jul 19, 2021