Show HN: FlashQwen – A from-scratch CUDA inference engine for Qwen3

langtang19961 pts0 comments

frankkk96 (Frank) · GitHub

" data-turbo-transient="true" />

Skip to content

Search or jump to...

Search code, repositories, users, issues, pull requests...

-->

Search

Clear

Search syntax tips

Provide feedback

--><br>We read every piece of feedback, and take your input very seriously.

Include my email address so I can be contacted

Cancel

Submit feedback

Saved searches

Use saved searches to filter your results more quickly

-->

Name

Query

To see all available qualifiers, see our documentation.

Cancel

Create saved search

Sign in

;ref_cta:Sign up;ref_loc:header logged out"}"<br>Sign up

Appearance settings

Resetting focus

You signed in with another tab or window. Reload to refresh your session.<br>You signed out in another tab or window. Reload to refresh your session.<br>You switched accounts on another tab or window. Reload to refresh your session.

Dismiss alert

{{ message }}

frankkk96

Follow

More

Overview

Repositories

Projects

Packages

Stars

frankkk96

Follow

🏠

Working from home

Frank

frankkk96

🏠

Working from home

Follow

ML Engineer, prev @microsoft, @tencentgames. WeChat: Frankkk96

followers<br>&middot;<br>following

https://frankk.site

@frank_uid

Achievements

Achievements

Highlights

Pro

Block or report user

Block or report frankkk96

-->

Block user

Prevent this user from interacting with your repositories and sending you notifications.<br>Learn more about blocking users.

You must be logged in to block users.

Add an optional note

Maximum 250 characters. Please don’t include any personal information such as legal names or email addresses. Markdown is supported. This note will only be visible to you.

Block user

Report abuse

Contact GitHub support about this user’s behavior.<br>Learn more about reporting abuse.

Report abuse

More

Overview

Repositories

Projects

Packages

Stars

Pinned

Loading

FlashQwen FlashQwen Public

From-scratch C++/CUDA inference engine for Qwen3-8B

Go

54

Something went wrong, please refresh the page to try again.

If the problem persists, check the GitHub status page<br>or contact support.

Uh oh!

There was an error while loading. Please reload this page.

You can’t perform that action at this time.

frankkk96 from search block user repositories

Related Articles