手记

MLSQL v1.1.7 Release roadmap

MLSQL v1.1.7 plans to release in Mid Jan 2019, this version will take almost three weeks.

MLSQL v1.1.7 Release Window:

DateEvent
Late Dec 2018New features and Improvement
Early Jan 2019Code freeze. Release branch cut. QA period. Focus on bug fixes, tests, stability, and docs. Generally, no new features merged.
Mid Jan 2019Release candidates (RC), voting, etc. until final release passes

New Features in MLSQL v1.1.7

  1. Hive JDBC supports. In spark, if you use JDBC data source to connect hive thrift jdbc server, it will fail since there is no HiveJDBCDialert implemented. Please check PR-828

  2. MongoDB supports. More detail please check PR-822

  3. Solr supports

  4. Docker release of  MLSQL-Cluster

  5. Docker release of MLSQL

Improvement in MLSQL v1.1.7

  1. Refactor DataSource adaptor which is used in load/save statement. This improvement will make people involved in MLSQL community more easy to contribute DataSource implementation. People no need to worry about affecting the core code in MLSQL when adding new data source adaptor. More detail please check PR-815

  2. Docker introduced to unit test in MLSQL. MLSQL provides an abstract server class which you can easy to use Docker to create a data source. This makes you can test if MLSQL is working properly with DataSource e.g. Kafka(multi-version), MongoDB, MySQL, and more.

  3. DataSource direct query mode. As we know, there are standard DataSource API in spark, but sometimes it will trigger full scan on the original table which is slow and resource waste.
    For example, we use SQL like select count(*) from table1 in MySQL,  it will be quick if it executed in MySQL instead of spark. So we provide a direct query mode. JDBC and ElasticSearch should support this mode in v1.1.7.

  4. The group-id will be changed from streaming.king to tech.mlsql.



作者:祝威廉
链接:https://www.jianshu.com/p/622ff0c7beb1


0人推荐
随时随地看视频
慕课网APP