add LLMBackendTrafficPolicy #35

wengyao04 · 2024-12-06T18:29:17Z

add LLMBackendTrafficPolicy, which controls the flow of traffic to the backend.
add ratelimit in LLMBackendTrafficPolicy

Signed-off-by: yweng14 <[email protected]>

mathetake

do you want to add tests in https://github.com/envoyproxy/ai-gateway/tree/main/tests/cel-validation
could you edit the PR description so that we can have more context and summary. e.g. how they are used and how do you envision them to be used to add other features. That ways reviewers can get to know the big picture before deciphering the code

api/v1alpha1/api.go

mathetake · 2024-12-06T19:11:27Z

api/v1alpha1/api.go

+// LLMBackendTrafficPolicy controls the flow of traffic to the backend.
+type LLMBackendTrafficPolicy struct {


could you add a bit more documentation here like for example this is used to setup rate limit etc.

Signed-off-by: yweng14 <[email protected]>

yuzisun · 2024-12-06T20:52:10Z

api/v1alpha1/api.go

+}
+
+// LLMPolicyRateLimitHeaderMatch defines the match attributes within the HTTP Headers of the request.
+type LLMPolicyRateLimitHeaderMatch struct {


Can we reuse the generic envoy gateway headerMatch type?

https://github.com/envoyproxy/gateway/blob/14fb56e9b01da0aa333945be4caafb35cf2b7fbb/api/v1alpha1/ratelimit_types.go#L142

yeah, I would like to reuse EG native type as much as possible

mathetake · 2024-12-06T20:53:38Z

sorry the CI has a bug #36 this will fix it 🙏

…imit

yuzisun · 2024-12-06T21:20:27Z

api/v1alpha1/api.go

+type LLMTrafficPolicyRateLimitRule struct {
+	// Headers is a list of request headers to match. Multiple header values are ANDed together,
+	// meaning, a request MUST match all the specified headers.
+	// At least one of headers or sourceCIDR condition must be specified.


it is not matching by sourceCIDR here, we can also document the canonical header such as x-ai-gateway-llm-model-name used to apply the rate limiting.

aabchoo · 2024-12-10T16:43:05Z

api/v1alpha1/api.go

+	// BackendRefs lists the LLMBackends that this traffic policy will apply
+	// The namespace is "local", i.e. the same namespace as the LLMRoute.
+	//
+	BackendRef LLMBackendLocalRef `json:"backendRef,omitempty"`


The description states "backendrefs lists the llmbackends" which implies that this variable should be updated to:

BackendRefs []LLMBackendLocalRef

Do we want a one (traffic policy) to many (backends) relationship? I think it makes sense to have that in the case where we have very similar models that we want to have the same rules for

wengyao04 force-pushed the trafficpolicy-ratelimit branch 2 times, most recently from 6b4acf3 to 27094a5 Compare December 6, 2024 18:57

add LLMBackendTrafficPolicy

309eaab

Signed-off-by: yweng14 <[email protected]>

wengyao04 force-pushed the trafficpolicy-ratelimit branch from 27094a5 to 309eaab Compare December 6, 2024 18:58

fix lint

f00f31b

Signed-off-by: yweng14 <[email protected]>

wengyao04 marked this pull request as ready for review December 6, 2024 19:04

wengyao04 requested review from aabchoo, mathetake and missBerg as code owners December 6, 2024 19:04

wengyao04 assigned zirain Dec 6, 2024

mathetake reviewed Dec 6, 2024

View reviewed changes

mathetake requested a review from a team December 6, 2024 19:13

mathetake mentioned this pull request Dec 6, 2024

Define Control Plane API v0.1.0 #13

Open

2 tasks

add cel test and address comments

3deaf80

Signed-off-by: yweng14 <[email protected]>

wengyao04 force-pushed the trafficpolicy-ratelimit branch from 46d1d38 to 3deaf80 Compare December 6, 2024 20:44

yuzisun reviewed Dec 6, 2024

View reviewed changes

Merge remote-tracking branch 'upstream/main' into trafficpolicy-ratel…

0ba6ba9

…imit

yuzisun reviewed Dec 6, 2024

View reviewed changes

aabchoo reviewed Dec 10, 2024

View reviewed changes

mathetake requested a review from arkodg December 11, 2024 18:18

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

add LLMBackendTrafficPolicy #35

add LLMBackendTrafficPolicy #35

wengyao04 commented Dec 6, 2024

mathetake left a comment

mathetake Dec 6, 2024

yuzisun Dec 6, 2024

mathetake Dec 11, 2024

mathetake commented Dec 6, 2024

yuzisun Dec 6, 2024

aabchoo Dec 10, 2024 •

edited

Loading

		// LLMBackendTrafficPolicy controls the flow of traffic to the backend.
		type LLMBackendTrafficPolicy struct {

add LLMBackendTrafficPolicy #35

Are you sure you want to change the base?

add LLMBackendTrafficPolicy #35

Conversation

wengyao04 commented Dec 6, 2024

mathetake left a comment

Choose a reason for hiding this comment

mathetake Dec 6, 2024

Choose a reason for hiding this comment

yuzisun Dec 6, 2024

Choose a reason for hiding this comment

mathetake Dec 11, 2024

Choose a reason for hiding this comment

mathetake commented Dec 6, 2024

yuzisun Dec 6, 2024

Choose a reason for hiding this comment

aabchoo Dec 10, 2024 • edited Loading

Choose a reason for hiding this comment

aabchoo Dec 10, 2024 •

edited

Loading