What Siamese Dreams are made of…

Pingback: How to potty train a Siamese Network – The Lone Nut

2018-04-17T14:06:35-04:00

Very cool application and solution. My question is how is the merged_vector passed to the triplet_loss function?
I see the output of the model is merged_vector and in the compile loss-triplet_loss, but how does the merged_vector values get into triplet_loss? Thanks, Jon

# Define the trainable model
model = Model(inputs=[anchor_in, pos_in, neg_in], outputs=merged_vector)
model.compile(optimizer=Adam(),
loss=triplet_loss)

LikeLike

Reply

2018-04-17T15:13:57-04:00

To clarify the question: how are y_true and y_pred passed to the triplet_loss function via model.compile.
Thanks, Jon

LikeLike

Reply

2018-04-18T10:57:05-04:00

So fist take a look at my answer on the “how to potty train a siamese network”. The loss function here was buggy… that can explain difficulty in understanding it 🙂 . Next, we don’t use the y_true in this version as we define the loss only on the y_pred, in fact on pairs of y_pred. So we need to split the y_pred into the components anchor, positive and negative and then we want to maximize the distance between anchor and negative while minimizing the distance anchor to positive. The model compile define the usage of that loss function, and the model.train will pass the examples prediction to the loss function in batches, that is Keras mechanics.

LikeLike

Reply

2019-06-06T01:54:11-04:00

Hi, I am trying to train a siamese network for signature verification. When I use loss functions which find distance between image encodings (e.g. triplet loss and contrastive loss). I don’t know why the model learns to encode every image into a vector of zeros.

But when I used ‘binary_crossentropy’ it is working fine.

LikeLike

Reply

	def triplet_loss(y_true, y_pred, alpha = 0.2):
	"""
	Implementation of the triplet loss function

	Arguments:
	y_true — true labels, required when you define a loss in Keras, not used in this function.
	y_pred — python list containing three objects:
	anchor: the encodings for the anchor data
	positive: the encodings for the positive data (similar to anchor)
	negative: the encodings for the negative data (different from anchor)

	Returns:
	loss — real number, value of the loss
	"""

	anchor, positive, negative = y_pred[0], y_pred[1], y_pred[2]

	# distance between the anchor and the positive
	pos_dist = tf.reduce_sum(tf.square(tf.subtract(anchor,positive)))

	# distance between the anchor and the negative
	neg_dist = tf.reduce_sum(tf.square(tf.subtract(anchor,negative)))

	# compute loss
	basic_loss = pos_dist-neg_dist+alpha

	loss = tf.maximum(basic_loss,0.0)

	return loss

	def create_base_network(in_dims, out_dims):
	"""
	Base network to be shared.
	"""
	model = Sequential()
	model.add(BatchNormalization(input_shape=in_dims))
	model.add(LSTM(512, return_sequences=True, dropout=0.2, recurrent_dropout=0.2, implementation=2))
	model.add(LSTM(512, return_sequences=False, dropout=0.2, recurrent_dropout=0.2, implementation=2))
	model.add(BatchNormalization())
	model.add(Dense(512, activation='relu'))
	model.add(BatchNormalization())
	model.add(Dense(out_dims, activation='relu'))
	model.add(BatchNormalization())

	return model

	in_dims = (N_MINS, n_feat)
	out_dims = N_FACTORS

	# Network definition
	with tf.device(tf_device):

	# Create the 3 inputs
	anchor_in = Input(shape=in_dims)
	pos_in = Input(shape=in_dims)
	neg_in = Input(shape=in_dims)

	# Share base network with the 3 inputs
	base_network = create_base_network(in_dims, out_dims)
	anchor_out = base_network(anchor_in)
	pos_out = base_network(pos_in)
	neg_out = base_network(neg_in)
	merged_vector = concatenate([anchor_out, pos_out, neg_out], axis=-1)

	# Define the trainable model
	model = Model(inputs=[anchor_in, pos_in, neg_in], outputs=merged_vector)
	model.compile(optimizer=Adam(),
	loss=triplet_loss)

	def identify_traffic(x, database, model):
	"""
	Implements traffic recognition.

	Arguments:
	x — the traffic to identify
	database — database containing recognized traffic encodings
	model — the encoding model

	Returns:
	min_dist — the minimum distance between traffic encoding and the encodings from the database
	identity — string, the traffic prediction name
	"""

	# Compute the target "encoding" for the traffic.
	encoding = traffic_to_encoding(x, model)

	# Find the closest encoding
	min_dist = 100

	identity = 'unknown'
	for (name, db_enc) in database.items():
	# Compute L2 distance between the target "encoding" and the current "emb" from the database.
	dist = np.linalg.norm(db_enc-encoding)

	# If this distance is less than the min_dist, then set min_dist to dist, and identity to name.
	if dist < min_dist:
	min_dist = dist
	identity = name

	return min_dist, identity

	database = {}

	database['normal'] = traffic_to_encoding(get_example_label(train_cases_df, df_lens, 0), base_network)
	database['error2'] = traffic_to_encoding(get_example_label(train_cases_df, df_lens, 1), base_network)

	# Prediction on traffic
	identify_traffic(x, database, base_network)

What Siamese Dreams are made of…

Published by TheLoneNut

5 thoughts on “What Siamese Dreams are made of…”

Leave a comment Cancel reply

	# Training the model
	model.fit(train_data, y_dummie, batch_size=256, epochs=10)

Share this:

Published by TheLoneNut

5 thoughts on “What Siamese Dreams are made of…”

Leave a comment Cancel reply