fix: correct Prompt serialization, Test.metadata alias, and multi-turn NPE by harry-rhesis · Pull Request #5 · rhesis-ai/rhesis-java

harry-rhesis · 2026-04-20T16:52:59Z

Summary

Fixes #3 and two additional bugs discovered during the audit.

Prompt.expectedResponse / languageCode now serialize at the top level — previously they were nested under prompt.metadata, causing the backend to silently drop them. Generated tests had no expected response on the platform.
Prompt.role removed — was never part of the backend API or Python SDK.
Test.metadata deserializes from the backend's test_metadata key — the backend renames metadata → test_metadata in GET responses to avoid a SQLAlchemy naming conflict. @JsonAlias("test_metadata") is now added so test.metadata() is no longer always null after a round-trip.
MultiTurnSynthesizer no longer NPEs on missing min_turns/max_turns — an LLM can omit these fields even when the schema marks them required. Values are now treated as optional, matching the Python SDK.

Test plan

BaseSynthesizerTest — null-turns NPE regression + numeric-turns positive case + expected_response/language_code top-level serialization (issue expected_response nested under prompt.metadata instead of directly on prompt, causing test evaluation to fail #3)
EntityTest — JSON-tree regression guard for top-level Prompt fields; alias unit tests for both metadata and test_metadata deserialization paths; NON_NULL serialization check
ClientWiremockTest — WireMock guard asserting expected_response/language_code appear at $.tests[0].prompt.* in outbound POST /test_sets/bulk (not under metadata)
PromptRoundTripIntegrationTest — live backend round-trip verifying expected_response survives POST /test_sets/bulk → GET /test_sets/{id}/tests (the exact scenario from issue expected_response nested under prompt.metadata instead of directly on prompt, causing test evaluation to fail #3)
TestSetRoundTripIntegrationTest — live backend round-trips for multi-turn test_configuration fields and test_metadata → metadata alias

All 55 unit tests + 5 live-backend integration tests pass.

Fixes #3 and two related bugs discovered during the audit. - Prompt: add `expectedResponse` and `languageCode` as direct top-level fields (JSON: `expected_response`, `language_code`) so they are sent to the backend as siblings of `content` rather than nested under `metadata`. Previously the backend silently dropped them, leaving no expected response on the platform. - Prompt: remove `role` field — it was never part of the backend API or Python SDK. - Prompt: annotate with `@JsonInclude(NON_NULL)` so unset fields are omitted. - Test: add `@JsonAlias("test_metadata")` on `metadata` so it deserialises from the backend's GET response key (`test_metadata`) that differs from the POST key (`metadata`). Previously `test.metadata()` was always null after a round-trip. - MultiTurnSynthesizer: null-guard `min_turns`/`max_turns` before casting to int. An LLM can omit these fields even when the schema marks them required, causing a NullPointerException. Values are now treated as optional, matching the Python SDK. Tests added: - BaseSynthesizerTest: null-turns NPE regression + numeric-turns positive case - ClientWiremockTest: WireMock guard asserting `expected_response`/`language_code` appear at `$.tests[0].prompt.*` in outbound POST /test_sets/bulk requests - EntityTest: JSON-tree regression guard for top-level Prompt fields; alias tests for both `metadata` and `test_metadata` deserialization paths - PromptRoundTripIntegrationTest: live backend round-trip for `expected_response` - TestSetRoundTripIntegrationTest: live backend round-trips for multi-turn `test_configuration` fields and `test_metadata` → `metadata` alias

harry-rhesis merged commit 3379b7b into main Apr 20, 2026
2 checks passed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

fix: correct Prompt serialization, Test.metadata alias, and multi-turn NPE#5

fix: correct Prompt serialization, Test.metadata alias, and multi-turn NPE#5
harry-rhesis merged 1 commit into
mainfrom
fix/test-metadata

harry-rhesis commented Apr 20, 2026

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant

Conversation

harry-rhesis commented Apr 20, 2026

Summary

Test plan

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant