xg.glass Project Team

Rokid Glasses + GPT-5.2 in a Real-World Test: How AI Glasses Will Impact the Future of Education

Jiayang Xu, Yufan Zhuang, Zijie Li, Jun Zhang, Zili Meng

GitHubhkust-spark/xg-glass-release
Image alt text

During a stressful final exam week, an inconspicuous "AI glasses" — smart AI glasses developed based on Rokid Glasses and equipped with OpenAI’s latest GPT-5.2 model released last week — achieved an astonishing score of 92.5 in the undergraduate final exam for Computer Communication Networks at the Hong Kong University of Science and Technology. This score successfully placed it among the top five in a class of over 100 students, significantly exceeding the student average of 72. Students could simply look down at the exam paper and quickly receive the answers.

This test, led by Professor Jun Zhang and Professor Zili Meng’s team at HKUST, not only verified the deep reasoning capabilities of AI glasses in complex knowledge tasks but also sounded an alarm to the global education community: the era of smart glasses for education has arrived. How to better use and regulate this technology will be crucial in the age of AI glasses.

Research Team's Test Results

The research team’s test results are shocking:

Test SubjectScore (out of 100)
Rokid Glasses (GPT 5.2)92.5
Student Highest Score97.5
Student Average Score72

AI Glasses Answering Workflow:

  • Image Capture: The student looks down at the exam paper, and the AI glasses' camera (such as the Rokid Glasses' 12-megapixel camera) quickly takes a photo.
  • Remote Inference: The exam photo is transmitted to the remote LLM via the glasses → phone → cloud pipeline.
  • Result Return: The answer is inferred by remote LLM, delivered via the reverse route (cloud → phone → glasses), and finally displayed on the glasses for the student to transcribe.
    Image alt text

Platform Selection

To achieve automatic answering with AI glasses, the glasses must satisfy both camera input and display output. A display is preferred over a speaker as it allows the answer to remain visible for a longer time, facilitating transcription.

We evaluated 12 mainstream commercial glasses:

CompanyBrand ModelCameraDisplay
MetaRay-Ban Gen2
MetaRay-Ban Display
Xingji MeizuStarV Air2
XiaomiXiaomi AI Glasses
Based HardwareOmi
MentraEven Realities G1
Even RealitiesEven Realities Vuzix 100
Gyges LabsHalliday
SolosSolos AirGo 3
OPPOAir Glass 3
Brilliant LabsFrame
RokidRokid Glasses

Only the Meta Ray-Ban, Frame, and Rokid smart glasses offer both an onboard camera and an integrated display. In addition, the glasses must provide an SDK for follow-on development. Although Meta provides a Device Access Toolkit, it does not expose an SDK for controlling the display. Therefore, only Frame and Rokid satisfy all three requirements. Because Frame’s camera image quality was insufficient, we ultimately selected Rokid Glasses as the test platform.

Large Model Selection

We evaluated mainstream large models. Although accuracy was similar, GPT-5.2 was chosen because it was significantly more responsive. GPT-5.2 is a powerful professional knowledge model newly released by OpenAI on December 11th of this year.

Here is an example video showing GPT-5.2’s faster response time than Gemini 3 Pro on Poe. GPT-5.2 produces the answer quickly, whereas Gemini 3 Pro takes noticeably longer to respond.

Detailed Performance Evaluation

Question TypeScoreFull Score
Multiple Choice2929
Single-Page SAQ1818
Multi-Page SAQ45.553
Total Score92.5100

Rokid scored full marks on both multiple-choice questions and single-page short-answer questions. Even on cross-page short-answer questions, it still earned most of the points. Cross-page SAQs are especially tricky because a single main problem can be split across several pages -- meaning the AI glasses can’t see the entire question at once. Sub-questions on later pages may depend on key information from earlier pages, which makes the whole workflow much harder. Despite that, Rokid handled these cases with ease.

Overall, the questions Rokid missed were usually the toughest sub-parts within a larger problem. They required non-trivial inferential calculation and put serious pressure on the model’s reasoning ability. And even when the final answer was wrong, the model still produced a substantial portion of the underlying reasoning steps.

Examples of correct answers:

Image alt text
Image alt text

Examples of wrong answers:

Image alt text

In the incorrect example above (caused by a knowledge discrepancy), the AI’s answer (111.123.15.254) was technically correct but did not match the simplified model used in the course exam (111.123.15.255). his mismatch occurred because the AI applied knowledge beyond what was covered in class.

Existing Issues and Challenges

Although the Rokid platform proved the software's feasibility, current commercial hardware still has the following shortcomings:

  • Image Quality Bottleneck: Camera clarity is the decisive factor for answering performance, as it directly determines whether AI can clearly read the question.
  • Power Consumption and Connection Stability: These factors are also critical.
    • Rokid: Enabling Wi-Fi for high-resolution photo transmission significantly increases power consumption. The battery level dropped from 100% to 58% after 30 minutes of answering.
    • Frame: Bluetooth connection stability is poor, especially at low battery levels. It disconnected 4 times within 15 minutes during testing, requiring a manual reconnection each time.
Image alt text

Impact on Education and Regulation

The rise of AI glasses in education is a double-edged sword.

  • Positive Impacts: Potential for educational technology innovation, such as personalized learning, providing tailored assistance and resources, reducing learning resource inequality, and helping students needing extra support improve academic performance.
  • Negative Impacts: Severe damage to academic integrity when used during exams, calling into question the fairness of the entire educational assessment system.

Educators and policymakers must act immediately to establish reasonable regulatory mechanisms to ensure technology is used for good:

  • Formulate Clear Policies and Regulations: Establish a legal framework that clearly defines the boundaries for smart device use in educational and examination settings. At the same time, enforce mandatory requirements for glasses manufacturers to indicate the presence of a camera through lights or other prompts.
  • Technical Monitoring and Detection: Employ signal jamming or specific monitoring software during exams to detect unauthorized use of smart devices.
  • Education and Publicity: Advocate for the legal and effective use of smart devices as aids during the learning process, but strictly prohibit their use during exams.

Only by finding a balance between ethics and technology, through comprehensive regulation at the legal, technical, educational, and cultural levels, can academic integrity be protected while truly realizing innovation and breakthroughs in educational technology.