KubedirectorCluster Connections #283

kmathur2 · 2020-03-11T03:13:42Z

What are the Connections ?

Currently, there are 2 types of connections -

Cluster connections
Configmap connections

Cluster connections - When we create a kubedirectorcluster we generate "configmeta" as part of cluster creation which has all cluster related metadata. There are tools like configcli and bdvcli to query this metadata from within a kubedirectorcluster pod. As part of this feature if some other cluster pod wants to query metadata from another cluster we use this connection feature. As part of the feature, kubedirectorapp can define the allowed attachment condition (example cluster category, distro_id, etc) based on which a given running cluster could be connected to the current cluster. As part of creating a new kubedirectorcluster the user can specify a list of kubedirectorcluster to be connected to this cluster's spec. The kdcluster reconciler will then dump the "cluster to be attached"'s configmeta in the new cluster and it could be then queried using conflicli from within the current cluster's pod.

Configmap connections - As part of the current cluster spec a list of configmaps (model, external auth, Kerberos, etc) could also be connected. A connected configmap will be propagated inside the running pods of the parent kdcluster and could then be queried using bdvcli or similar tools.

Example of connections at play, as part of kdcluster spec

spec:
  connections:
    configmaps:
    - "model-prediction"
    - "noise-recognition"
    clusters:
    - "spark-logs-nightly"
    - "kafka-streaming"

Change Summary:
In KubeDirectorAppSpec struct added a new field

ConnectableTo: This enumerates a list of allowed categories for connectable clusters

In KubeDirectorClusterSpec struct added 2 new fields

ConfigMetaGenerator: Everytime a new connection is added/deleted this is bumped up
Connections: New struct that carries a list of config maps and clusters to be connected

In KubeDirectorClusterStatus struct added 1 new field

LastConfigMetaGenerator: After every reconfig, this captures the last value of
ConfigMetaGenerator

Added reconciler for config maps to watch on any changes to the config map in the cluster namespace
Every time connected configmap or cluster is updated in the spec configMetaGenerator is incremented.
The cluster reconciler periodically looks for the value of configMetaGenerator and compares against lastConfigMetaGenerator, if the value has changed then it triggers regeneration of configmeta and recreats configmeta.json file for all pods in the running cluster
configMetaGenerator will be incremented every time a connection is added/removed.

FollowUp tasks

Need to add guesconfig hooks for configmap update
Finalize on connectable_to property
Reconciler refactoring using Enque fun
Hash approach instead of configMeta incrementer
Settle if configmap should even be connection or can be mounted like secret or vice versa
Finalize on v1Beta2 API

…config data map

… kartik-attachments

riteshja

None of the example app CRs are using the attachable_to property yet in this PR. Right?

joel-bluedata · 2020-04-17T16:53:04Z

The kdcluster CRD needs to be updated with the new spec and status properties.

pkg/controller/configmap/configmap.go

joel-bluedata · 2020-04-17T17:00:15Z

pkg/controller/kubedirectorcluster/cluster.go

+					//Notify cluster by incrementing configmetaGenerator
+					wait := time.Second
+					maxWait := 4096 * time.Second
+					for {


The process of

read cluster object

increment the observed gen number in spec

write cluster object

Needs to be inside this for loop. That's because the most likely reason for this update to fail is because of a conflict with some other write... so, we will need to read the cluster object again to get a copy of the object that includes the new updated resource version.

Also if we want to guarantee that we actually are bumping the spec gen by 1, then that's another reason to have the whole read-modify-write process inside the loop here.

As a side note: you COULD sidestep this looping by using PATCH, now that we have moved to a new version of the SDK that supports PATCH. However if you don't want to dig into that right now I don't blame you. I think it's more straightforward at the moment just to move the read-modify-write inside the for loop.

joel-bluedata · 2020-04-17T17:00:46Z

pkg/controller/configmap/configmap.go

+					if shared.Update(context.TODO(), updateMetaGenerator) == nil {
+						break
+					}
+					if wait > maxWait {


Similar comment about the for loop here and read-modify-write.

joel-bluedata · 2020-04-17T17:00:57Z

Done w/ review pass. We're in the home stretch!

joel-bluedata · 2020-04-17T18:35:47Z

Ah I see. Sorry! Still, it might be good to have it say which connection changed, which might be easier to do in the other spot.

BTW typo "chamge" there.

kmathur2 · 2020-04-17T19:14:06Z

The kdcluster CRD needs to be updated with the new spec and status properties.

Won't this go in v1Beta2 CRD when we finalize that?

joel-bluedata · 2020-04-17T19:21:29Z

Eh, yeah. IMO it's nice to express it here too though, just so this whole change is self-consistent. Not a big deal if you'd rather wait (assuming we get the v1beta2 change in soon).

joel-bluedata · 2020-04-17T19:22:41Z

We are doing in both places

We're posting an event against foo that says foo is affecting bar, and we're posting an event against bar that says "I'm being affected by... something". What I'm suggesting is that the event on bar should ideally say that foo caused the change.

joel-bluedata · 2020-04-17T20:12:41Z

That's why I suggested posting that event in the other code.

kmathur2 · 2020-04-17T20:20:12Z

That's why I suggested posting that event in the other code.

At the other place (foo) already have - "configmap {%s} is a connection to cluster {%s}, updating its configmeta" , does this not suffice !!! I am soo confused

pkg/controller/configmap/configmap.go

joel-bluedata · 2020-04-17T20:34:30Z

Taking event-posting-chat to Slack for a bit. :-)

pkg/controller/configmap/configmap.go

pkg/controller/kubedirectorcluster/cluster.go

joel-bluedata · 2020-04-17T21:55:28Z

Once those pesky events are sorted I think we're good to go. I'll do a test build before merging to master... let me know if you have some example cluster CRs I can play with.

joel-bluedata · 2020-04-22T17:41:38Z

The comment above about event posting in configmap.go is still unresolved FYI.

kmathur2 added 27 commits December 5, 2019 11:46

attachments first push

55c3c1f

add function comments

d899a9c

model as a config map

fd050cd

remove model object

4274f9c

attachment should have model which is a map of map name and value as …

479a5c2

…config data map

testing app changes

2c62a0d

Merge branch 'master' of https://github.com/bluek8s/kubedirector into…

d1ce820

… kartik-attachments

add watcher and stuff

2ce8d19

generate configmeta and push inside pods

ab008d1

some refactoring

247cc2f

some more cleanup

241d92c

fix domain base

35f69ed

fix merge conflicts

51c561c

fix examples

f9c18e8

fix merge conflicts

8e1a4b4

Merge branch 'master' of https://github.com/bluek8s/kubedirector into…

0b32e00

… kartik-attachments

resolve more conflicts

3004567

push first cut basic

9fed699

Merge branch 'master' of https://github.com/bluek8s/kubedirector into…

a442dd0

… kartik-attachments

add stuff

bb84c5f

wip for connections

370d66c

more configmeta generator changes

0e5ac56

remove unrequired code

1ab32cb

config map could be really generic

1b823b8

lets call it connections instead of attachments

c500f5b

fix more attachment to connection renaming

77659a9

don't need to expose initRole now

acb59b1

kmathur2 requested a review from joel-bluedata March 11, 2020 03:14

kmathur2 mentioned this pull request Mar 11, 2020

KubedirectorCluster/Model Attachments #233

Closed

riteshja reviewed Mar 11, 2020

View reviewed changes